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PREFACE 


It is hoped that these notes will be of assistance to the designer who requires mem- 
ory function in his system. The variety of semiconductor memory products is 
already quite extensive and is growing rapidly. This booklet has attempted to 
give some understanding of the characteristics of the types of memory now avail- 
able and some hints about how to solve a few system design problems, However, 
the variety of applications is so broad that is has been possible only to very briefly 


touch upon a few of them. Nae CW pbe~ 


Marcian Hoff 


INTRODUCTION 


Memory function is one of the most impor- 
tant elements of a digital computer. Large 
computer systems often contain many types 
of memories: reels of tape and drawers of 
cards, disk cartridges, ferrite core or semi- 
conductor main storage, fast registers, etc. 
Each of these different types of memory 
offers a means for storing and retrieving data 
(and programs which are also “‘data’’) for use 
by the computer system, yet they differ in 
cost, organization, retrieval time, etc. 


Memory function is also important to many 
other systems which utilize digital processing. 
Digital filters, computer terminals, electronic 
cash registers, traffic-light controllers — all 
may require data storage to achieve their in- 
tended function. 


Today there are many types of memory 
available to the designer. One of the most 
recent additions to the repertory of mem- 
ory technologies is the semiconductor mem- 
ory. In fact, there are many new semicon- 
ductor memory components, which utilize 
different technologies, and offer different 
organizations, operating characteristics, etc. 


The purpose of this booklet is to offer the 
designer some basic information about the 
types of semiconductor memory available 
and how to utilize them effectively. 


The booklet is divided into six parts. Part 1 
describes basic random access, fixed-address 
memories (RAMs). This type of memory 
organization is most commonly used as the 
main memory of general purpose digital com- 
puters, and is also used to realize fast registers 
or scratch-pad memories, many types of buf- 
fer memories, etc. Part 2 discusses Read Only 
Memories (ROMs) and Programmable Read 
Only Memories (PROMs) which are also al- 
most always random-access, fixed-address 
memories as well. ROMs/PROMs are used for 
data tables, control units, as the main mem- 
ory of certain dedicated special purpose com- 
puters, and to replace general random logic. 
Part 3 discusses serially addressed memory 
(shift registers). Part 4 discusses memory 
organizations other than fixed address, such 
as Content Addressable Memory (CAMs), buf- 
fered memories, virtual memory, etc. Part 5 
discusses Binary Coded Decimal (BCD) Ad- 
dressing techniques. Part 6 includes selected 
article reprints of process technology and de- 
vice applications. 
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INTRODUCTION 


As defined here, a random access memory is one in which 
the writing time and retrieval (access or cycle) times are 
essentially independent of the location into which the 
data is entered. By fixed address, it is meant that data is 
entered into a known, pre-determined location, and is re- 
trieved from that same location. In a fixed address RAM 
the data does not move around in the memory except by 
external control. 


For many years, there has been just one practical way to_ 
build a random access high speed memory — use ferrite 

cores. But suddenly, the designer of memory systems is con- 

fronted with a proliferation of new semiconductor memory 

devices and a host of new rules and considerations to face. 

The designer who is now in that position, facing a pile of 
data sheets, and wondering where to start, shouldn’t give up 

hope. Semiconductor memories are easy to use and offer 

high performance. Most important of all, some of the 

newest semiconductor components cost less per bit than 

core memory. 


First consider some of the types of RAM building blocks 
now available. These can be first divided into three basic 
categories, related to the technology and circuit techniques 
used for their implementation: 


1. Bipolar 


Bipolar technology is most familiar to the logic designer in 
the form of RTL, TTL, ECL, and other high performance 
low voltage circuit families. Bipolar circuits are so called 
because the technology primarily realizes npn bipolar junc- 
tion transistors, Other components which can be integrated 
include resistors and junction diodes. A more recently 
developed bipolar technology, the Schottky technology, 
also allows Schottky diodes, and substrate and lateral pnp 
transistors. 


Bipolar semiconductor memories offer high performance 
and are usually directly compatible with one or more of the. 
standard bipolar logic families. Presently available are bi- 
polar memories of up to 256 bits per chip. When com- 
pared to MOS memories, bipolar units are usually faster, 
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but cost more and usually dissipate more power. 


2. Static MOS 


Both static and dynamic MOS circuits use insulated gate 
field effect transistors as the primary circuit element. A 
few circuits may also include diffused resistors. However, 
in static circuits, the devices are used as if they were DC 
amplifiers while in dynamic circuit extensive use is made 
of capacitances for temporary storage. 


In general, static MOS circuits are easier to use in systems 
than dynamic MOS circuits, but require more power and 
cost more per circuit function. With the availability of newer 
(low threshold voltage) MOS technology, many static MOS 
circuits can be directly interfaced to TTL or DTL logic 
circuits, 

Static MOS designs are available in both n and p channel 
technologies. The n channel MOS technology offers even 
easier interface to TTL than the p channel technology. 
Static MOS memories are now available with up to 1024 
bits per chip. These memories are usually slower than bi- 
pular memories of a Siitiilar size Dy a factor of 5 to 10, bit 
cost less because of the inherently higher yield and smaller 
size of the MOS devices. 


3. Dynamic MOS 


Dynamic MOS circuits make use of temporary storage of 
data on the parasitic capacitances within the circuit. Due 
to leakages associated with junctions within the circuit, the 
charge on these capacitances may leak off in a few milli- 
seconds. To prevent loss of data, this charge must be re- 
stored (refreshed). Restoration of the charge is achieved by 
regenerating or recirculating the data. Dynamic circuits 
usually require externally generated clock voltages. These 
. clock voltages are usually too high to be generated with 
ordinary integrated circuits, so special ICs or discrete circuit 
components must often be used. As a result, dynamic cir- 
cuits are usually more difficult to interface to standard bi- 
polar logic families than the other types of circuits described 
above. 


RAMs 
Although more difficult to use, dynamic MOS circuits 
offer some very significant advantages. The chip area re- 
quired per unit function is much smaller than for either 
bipolar or static MOS. This high functional density com- 
bined with the higher yield or silicon-gate MOS technology, 
makes the dynamic MOS circuit the most economical of the 
three approaches, and permits large amounts of memory to 
be integrated on a single chip. 1024 bit dynamic MOS 
memories have been available for some time and larger 
units are in development. Speeds possible with dynamic 
MOS circuits are much faster than for static MOS, although 
usually not quite as fast as for bipolar circuits. Thus, be- 
cause high functional density can be combined with high 
performance, dynamic MOS circuits are potentially the 
strongest contenders for use in large random access memory 
systems. Dynamic MOS circuits also offer the lowest power 
dissipation per function of the three basic approaches, In 
many cases, such circuits are designed so that they dissipate 
power only during access or data regeneration operations. 


These properties are summarized in Table 1.1 below. Sam- 
ples of different types of semiconductor memory elements 
are shown together with the technology used. 


The access times and operating powers given in Table 1.1 
refer to the memory parts themselves. However, when used 
in systems, actual access and cycle times are usually some- 
what greater and may be affected significantly by memory 
size. In addition, system cost per bit may be a significant 
function of memory size. Figure 1.1 gives relative cost and 
access time for systems over a range of memory sizes for 
each of the products described above. Additional compo- 
nents for memory systems are assumed to be drawn from 
TTL logic families. 


From the charts of Figure 1.1, the relative speed and cost of 
the different approaches can be determined. If speed is un- 
important and a relatively large memory must be built, then 
the 1103 dynamic MOS RAM is the obvious choice. How- 
ever, when the- memory needed is small, one of the other 
components may be more economical, because they require 
less peripheral and control circuitry than does the dynamic 
MOS. In other cases, very high speed may be required so 
that one of the bipolar products must be used. 


Table 1.1. Organization and Operation of Fixed Address RAM 


—_—————  $  — ————————— er _M—GC ONw«,»>TCY 
Part No, | Technology 
3101 Bipolar 16 x4 
3101A Bipolar 16 x4 
3106/7 Bipolar 256 x1 
3106A/7A | Bipolar 256 x 1 
1101A Static MOS - 256 x 1 


p-channel 
2102 Static MOS - 1024 x1 
n-channel 
1103 Dynamic MOS - 1024 x1 
p-channel 
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Access/Cycle Operating Power/Bit | Standby Power/Bit 
60/60 ns 
35/35ns 
80/80 ns 
60/60 ns 
1500/1500 ns 


1000/1000 ns 


300/600 ns 


intel” RAMs 


Cost vs. Size 


CENTS/BIT 


0 10 20 30 40 #50 #60 #70 #+}80 90 100 


SIZE (THOUSANDS OF BITS) 


Access Time vs. Size 


800 


400 


SIZE — BITS 


Figure 1.1, Memory System Characteristic vs. Size 


Having chosen a suitable memory part for the basic building 


block of the system, the designer must next organize the 
system. The design of the static MOS memories is quite 


similar to that of bipolar memories. However, dynamic 
MOS random access memories may require a number of 
additional considerations. 


DESIGNS USING BIPOLAR and 


STATIC MOS CIRCUITS 


Fixed address RAMs are usually organized as [some num- 
ber] N words of [some number] M bits each. N may be 
any value from 8 to 16 for scratch pad memories to several 
million for very large computer memories. M usually falls 
in the range of 6 to several hundred. Because most com- 
- puters use binary arithmetic, the address which selects one 
of the N words is usually a binary value of some k bits, 
with N designed to be equal to a power of two, that is, 2”. 


To some extent, these organizations are the result of ferrite 
core economics, for core memory costs are usually lowest 
when N = M4. Semiconductor memory offers greater free- 
dom of organization, with the organization limits being de- 
termined by the size of the basic building blocks, i.e., mem- 
ory components. Not all words of a semiconductor memory 
need have the same number of bits, nor is it necessary for 
N to always be exactly some power of two. For example, 
when using 1024 x 1 memory components such as the 
1103 or 2102, a memory of 3036 or 5120 words could 
easily be constructed with a cost per bit about the same 
as for a 4096 word memory. 


In general, each semiconductor memory chip realizes some 


2? words of q bits each. As long as N is a multiple of 2? 
and M a multiple of q, an N word by M bit memory can be 
realized without wasting bits. 


To build an N word by M bit memory from 2P word x q bit 
components, the components are effectively laid out in a 
two dimensional array. The array must include a total of 
N/2? rows and M/q columns. 


All of the semiconductor memory components listed in 
Table 1.1 are designed to make the connection in arrays as 
easy as possible, and with as few additional components as 
possible. 


All permit several devices to have inputs and outputs con- 
nected to common input and output busses. One chip at 
a time is enabled for writing or reading (placing its data on 
an output bus) by means of a chip select signal. 


To wire an array of 2P word by q bit chips as an N=2¥ word 
memory, the array will contain 2k-P rows of M/q circuits 
(chips or packages) each. 


Of the k address lines to the memory, p will be common 
to all chips. The remaining k-p address lines will be decoded 
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M INPUT LEADS 


("% GROUPS OF q LEADS EACH) 


WRITE ENABLE 
(COMMON TO ALL 
DEVICES IN ARRAY) 
BUFFERED AS 
NECESSARY 


K ADDRESS 
LINES IN : P ADDRESS LINES 


(COMMON TO ALL 
PACKAGES IN ARRAY) 


(ADDRESS LINES BUFFERED 
AS REQUIRED) 


\/ \/ INPUT BUFFERS 
(AS REQUIRED) 


BUFFERS 
(AS REQUIRED) 


M OUTPUT LEADS 


(3 GROUPS OF q LEADS EACH) 


Figure 1.2, Organization of 2k Word by m Bit Memory from 2P Word by q Bit Memory 


to generate 2®-P row select signals. One of these two select 
signals drives all of the chip select leads in the row. 


Figure 1.2 shows this basic organization. 


These general rules apply to most of the memory products 
listed in Table 1.1. Because the remaining considerations, 
buffer circuitry, etc. are quite different for dynamic MOS 
system than for bipolar and static MOS, the remainder of 
this section has been divided into two segments. 


In general, the rules of wiring the array of Figure 1.2 are 
as follows: 


1. All corresponding power supply leads are made common 
throughout the array. In general, a two dimensional grid 
should be used when possible. (See Figure 1.3.) 
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2. The write enable signal is made common throughout the 
array. 


3. All corresponding addresses are made common through- 
out the array. 


4. Corresponding data input and data output leads are made 
common within array columns. 


5. Corresponding chip select leads are made common within 
each row. The function of the chip select leads is to permit 
the array interconnection. When conditions for chip (i.e., 
row) selection are not met, no input signal can affect the 
contents of that row, nor does any unselected chip affect 
the signals on the data output line to which it is attached. 
Thus the chip select leads permit output leads to be OR- 


tied and eliminates the necessity to decode the write pulse 


signals, 


’ The array of Figure 1.2 can be conveniently laid out using . 


two printed circuit layers. In Figure 1.2, buffer circuits on 
address and data input and outputs have been indicated. 
The degree to which such buffer circuits are necessary is a 
function of the type of circuit used, the loading that can 
be tolerated by the external circuits, and the degradation in 
access and cycle time which is acceptable. ~ 

- Figure 1.3a shows the pin connections for three of the parts 


a. Pin Connections 


AODRESS INPUT A,{_J1 
CHIP SELECT CS(L]2 15f JA, ADORESS INPUT 


WRITE ENABLE WE (J3 147) a, ADDRESS INPUT 


DATA INPUT 0,44 
DATA OUTPUT 0, 
DATA INPUT 0, 


DATA OUTPUT 0, 


13 {J A, ADDRESS INPUT 
12 70, DATA INPUT 
11 {J 0, oata ouTPUT 


10 JO, DATA INPUT 


9 [J0, DATA OUTPUT 


b. Interconnections for Memory Arrays 


SELECT Al A2 A3 14 04 13 0 
As w tt ol 12 02 


3101 


mentioned in table 1.1 and Figure 1.3b shows typical lay- 
outs for each of the memory parts included in this category, 
with the orientation of all layouts chosen so that chip selects 
run horizontally in rows and data leads run vertically in col- 
umns. The remaining leads are run in the most convenient 
direction. All of these layouts make use of leads running 
between the pins of the dual in line package. To prevent 
solder shorts, these lines are run on the component side of 
the board. In Figure 1.3b the lines on the soldered side of 
the board are shown dotted. . 


ADDRESS INPUT 16 [J Vcc SUPPLY VOLTAGE 
ADDRESS INPUT 159-JA, ADDRESS INPUT | 
CHIPSELECT ¢S 149] A, ADDRESSINPUT 
DATA OUT CHIPSELECT CS. 13 JD, DATA INPUT 
DATA IN CHIPSELECT 8 12%] we WRITE ENABLE 
11) A, ADDRESS INPUT 
10] A, ADDRESS INPUT 


GROUND GNOC__Js 9 JA, ADORESS INPUT 


Figure 1.3, Printed Circuit Layout of Memory Arrays 
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A. BUFFER CIRCUITS 


A.1 TTL to P-channel MOS 


As an example of interfacing TTL to p-channel static MOS 
circuitry, consider the input requirements for the 1101A 
memory, which is realized with p-channel (low-threshold 
voltage) silicon gate MOS technology. 


1101A inputs are essentially purely capacitive, so that when 
driving the 1101A memory arrays, the primary considera- 
tions involve: © the effects of significant capacitive loads 
on address and data line buffers, and 0 guaranteeing proper 
voltage levels on these lines. 


Because 1101A memory arrays are usually relatively slow, 
the speed degradation associated with capacitive loads are 
usually not so important as the effects of reflecting large 
capacitive loads back into the driving capacity. 


Figure 1.4 shows the nature of the TTL to 1101A static 
MOS interface. In Figure 1.4, Q; conducts when the TTL 
gate output is sufficiently negative with respect to Vcc (at 
least 4.5V below Voc). The ratio of geometries between 
Q and Q¢ is chosen so that when Q, conducts, the internal 
MOS signal line is brought within a volt or two of Vcc. 
When the TTL gate output is high, Q; does not conduct, 
and the internal MOS line is drawn toward the —V supply 
by Qo. This high level must be no lower than Vcc —2V. 
Resistor R should be added to guarantee proper operating 
levels, as TTL outputs without the resistor are only guaran- 
teed to reach Voc —2.35V (7400 series). Note that the 
input signal voltage is referenced to Voc. It is, therefore, 
important that Voc be adequately bypassed to the TTL 
ground line. Voc is normally the same supply as used for 
the TTL logic. When driving p-channel MOS, it is undesir- 
able to allow this voltage to fall too low, for noise immunity 
may suffer. Because the MOS input draws no DC current 
for normal bias, a very large number of devices may be 
driven by a single TTL gate, although the capacitive loading 
in large arrays may cause some speed degradation. 


ZENER 


GATE 
PROTECTION 
DIODE 


TTL GATE 


INTERNAL 
MOS 
SIGNAL 


Q2 


Figure 1.4, TTL to p-channel MOS Interface (1101A) 
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The output of the p-channel MOS circuit can be designed 
to drivea TTL or low power TTL (LPTTL) gate. The 1101A 
can drive at least one TTL gate over the full temperature 
range of 0 to 70°C. Figure 1.5 shows the nature of the p- 
channel MOS to TTL interface. Parts which are disabled 
(not enabled by chip select) have neither Q; nor Qo con- 
ducting. For a logic “1”, Qy conducts, providing a 1 level 
very nearly at Vcc. However, for a logic “0”, Q2 conducts. 
For the 1101A, Qo sinks at least 2.0mA when the output 
is at .45V above ground, thus exceeding the minimum TTL 
sinking requirement of 1.6mA. Typically, the excess sinking 
current capability of the 1101A causes the TTL input clamp 
diode Dj, to be forward biased (at around —.7V with respect 
to ground) so that some TTL substrate current flows. The 
maximum current from the 1101A is 13mA at —1V. 


OUTPUT 
DISABLE 


DATA 


Figure 1.5, P-channel MOS to TTL Interface 


A.2 TTL to N-channel MOS (2102) 


The 2102 is an n-channel 1024 word by 1 bit memory, It 
is ideally suited for applications where high performance, 
low cost, large bit storage, and simple interfacing are im- 
portant design objectives. The 2102 requires only a single 
+5V power supply. The 2102 has operating conditions 
which are almost identical to TTL, so that interfacing the 
2102 to TTL is far easier than for p-channel MOS devices. 
Figure 1.6 shows the nature of the input and output 
interfaces. 


Within the n-channel circuit, the signal voltage levels are 
very much like TTL levels. When the TTL output is high, 
Q 1 conducts, producing a low internal level. When the TTL 
output is low, Q, is off, and Qg produces a high internal 
level. Input requirements are for high input levels of 2.2V 
or greater and low input levels of .65V or less. Asa result, 
noise margins are somewhat less than those of TTL circuitry. 


n-CHANNEL MOS 


Figure 1.6. N-channel TTL Interface 


However, the slower n-channel circuits do not respond as 
rapidly to noise as bipolar TTL circuits. This slower res- 
ponse somewhat compensates for the reduced noise margins. 


The output circuit is capable of driving one 1.66mA TTL 

load with .3mA in reserve. However, when OR-tying 
several devices in an array, this .3mA is needed to compen- 
sate for output leakage currents, These output leakages 
(100uA maximum) are equivalent to conduction currents 
in output transistor Q4 of disabled devices. Asa result, no 
more than 4 devices (2102s) should be OR-tied when driving 
a standard TTL load. For larger arrays, OR-tie capability 
can be increased by an LPTTL buffer stage, or by organiz- 
ing the memory in 4k word modules each with its own out- 
put buffer. These output buffers might be open-collector 
TTL gates enabling data onto a bus or. might be realized 
using multiplexers. The TTL circuits may be operated from 
the power supply provided for the 2102s. 


A.3 Bipolar Buffer Considerations 


Once a bipolar memory has been laid out using the tech- 
niques shown in Figures 1.2 and 1.3, the signals at the 
periphery must be properly dealt with. 


The 3101, 3101A, 3106, 3106A, and 3107 memories are 
all designed using a Schottky technology, and except for the 
3101, have input loads less than 1/6 that of standard TTL. 


In spite of these small loading factors, buffering is usually 
necessary because the large number of inputs which must 
be driven represent a significant capacitive load. Because 
of TTL loading restrictions, several levels of buffering may 
be necessary. 


The bipolar components listed have either open collector 
outputs (3101, 3101A, 3107, 3107A) or three-state output 
(3106, 3106A). 


Those components with open collector output require a 
pull-up resistor or a resistor terminator. When the output 
is going from “1” to “0”, discharge current for load capa- 
citance is provided by the output transistor in the memory 


RAMs 


device. When the output goes from “0” to “1”, the capae- 


‘itor charging current is provided by a pull-up resistor. A 


resistive terminator circuit may be used to reduce the time 
constant of the output circuit, yet still not exceed the cur- 
rent sinking capacity of the output devices, Such a circuit 
is shown in Figure 1.7. This terminator reduces the time 
for the one to zero transition, while slightly degrading the 
zero to one transition. For such memories as the 3101, 
a net reduction in worst case access time results. 


OUTPUT BUS 


Figure 1.7. Resistor Terminator for Bipolar Outputs 


B. ACCESS TIME CONSIDERATIONS 


To estimate the access time of a system using these semi- 
conductor memory elements, the worst case path for the 
addressing mode used must be found. Figure 1.8 shows 
symbolically some of the paths in a typical memory system. 


To estimate the access time, the longest path through the 
system is found (taking into consideration the differences 
in delay for chip select and address inputs, and including 
any delays due to wire, excessive capacitive loading, etc.). 
Note that, as a general rule, every increase in memory 
size by a factor of 8 to 10 requires at least one more level 
of buffering on all inputs and requires one or more levels 
of multiplexing on outputs. Therefore, access time in TTL- 
buffered systems tends to increase by 20 to 30nsec. for each 
order of magnitude increase in memory size, until the physi- 
cal size of the memory also contributes significant delay. 


All of the parts described allow an OR-tie connection of 
output leads. However, each additional device connected 
to the output adds capacitive loading to the output line. 
When very large memory arrays are used, this capacitive 
loading may seriously increase access time due to the time 
required for the output line to be charged or discharged. 
In general, the access time is incurred by an amount approxi- 
mately equal to the time constant of the data line. 


Three state outputs are provided on the 3106 and 3106A 
as opposed to open collector outputs on the 3107 and 
3107A. Some designers prefer the three-state output be- 
cause of its increased capacitance driving capability and 
ability to handle more TTL loads on the bus. (Total 
TTL load driving capability is reduced by the pull-up re- 
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CHIP SELECT PATH 


DECODE 
BUFFERS, 


‘put ADDRESS porn 
ADDRESS | 
| | 


ADDRESS BUFFERS 


BUFFERS, 


INVERTERS 


MEMORY 
ARRAY 


DECODE 


OUTPUT 
ENABLE 


MEMORY 
DATA LINE 
TERMINATOR 
(3101, 3107) 


BUS TERMINATOR 


Figure 1.8. Possible Delay Paths in 1101A, 3101 and 3106/7 Systems 


sistor loading.) Other designers object to the increased sys- 
tem noise and power supply spiking which may take place 
if the switching of the various three-state devices on a bus 
line are not carefully synchronized. 


C. CHIP SELECT DECODING 


Each of the buffered arrays described above presents a num- 
ber of address leads, together with one or more chip selects 
per row. The address leads perform selection of a single 
word from each row while chip select leads act to select 
which row will deliver data to (or accept data from) the out- 
side of the array. - 


To select a single word location in an array of 2? words 
requires p binary address bits. If each memory device con- 
tains 2” words, 2P"" rows must be used in the memory 
device array. To select a word, n of the address leads are 
sent to the array as common addresses and p-n are decoded 
to provide row selection. The chip select signals are used 
in conjunction with row select decoding. 


For the 1101A and 3101A memory devices, which have a 
single chip select input, the additional address bits are 
decoded such that the chip select for the selected row is 
low, and all others remain high. For memory parts with 
multiple chip selects such as the 3106/7, the multiple chip 
select inputs may be used to decode some of the additional 
address bits. Examples of chip select decoding are shown in 
Figure 1.9. For small arrays, there may be sufficient chip 
select inputs to decode all of the additional address bits. 


Figure 1.9 shows how the 3205 one of eight decoder can be 
used to decode 3 to 6 additional address input leads and 
generate 8 to 64 chip select signals. These chip select 
signals are compatible with 3101, 3101A, 3106, 3106A, 
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3107, 3107A, and with the 1101A (if pull-up resistors are 
added to the outputs of the 3205). 


D. POWER CONSIDERATIONS 


The random access memory products discussed above all 
dissipate power at significantly greater levels per package 
than most LSI and MSI components. In addition, the 
array wiring shown provides much denser packing than is 
commonly achieved with random logic. As a result, the 
power dissipation per unit area of circuit board can approach 
an order of magnitude greater than ordinary TTL logic. 
The system designer must insure an adequate flow of cooling 
air in any design which uses more than a few packages. 


Power distribution is also important. Although most static 
MOS and bipolar devices draw relatively constant current 
from the power supplier, MOS output circuits, particularly 
those of p-channel MOS, can contribute significant tran- 
sients to power supplies. Bipolar devices with relatively 
low pull-up resistors and those with three state outputs 
can also introduce significant transients. To insure proper 
operation of both memory and the surrounding logic cir- 
cuitry, power supplies should be adequately bypassed. Cer- 
amic capacitors in the .001 to .05ufd range are recom- 
mended, with one capacitor being used for every one to 10 
parts. Bypassing requirements are usually less stringent for 
multi-layer boards using power and ground planes. 


Two of the static memory components described here, the 
2102 and the 1101A, offer reduced standby power modes 
of operation. These parts have been designed such that the 
memory cells can operate on reduced voltages without loss 
of data. In addition, the 1101A provides separate power 
leads for the memory cells and the peripheral decoders. 


To enter standby mode, the chips should all be deselected, 
and in the read mode, before the voltages are lowered. For 
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b. 16 Chip Selects From Two 3205s 


Figure 1.9. Chip Select Decoding Using The 3205 


the 2102, all addresses should be held low to achieve the 
greatest power savings. Power supplies should switch to the 
voltages (standby or operating voltage) without ‘“‘glitches”’ 
or overshoot. At least one microsecond cycle time should 
be allowed between the last access or write cycle before 
power is lowered, and one microsecond cycle time should be 
allowed after normal voltages are restored before operating 
the memory. Table 1.2 shows the possible power saving. 


Table 1,2, ‘Low Power Standby — Static MOS Memories 


720mW 
370mW 


Ek. PAGING TECHNIQUES FOR ACCESS TIME. _ 
REDUCTION feces aa: 
Semiconductor memory systems may exhibit different ac... . 
cess times for different address changes. For example, when 
only the multiplexer control bits are changed for multi- — 
plexer connected modules, the access time for new data 
fetched from within an array is equal to the multiplexer 
switching time. Similarly, the chip select access for 1101As 
is much faster than the address-change access. In the 
3106/7, access for addresses Ag to Ag is typically much 
faster than that of addresses A 4 through A 7. : 


It is often possible to organize a memory system such that 
some address bit changes result in shorter access times than 
others. Simple circuits may be added which compare the 
new address with the old to signal whether fast or slow 
access will be obtained. 
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DYNAMIC MOS MEMORY--THE 1103 


The Intel 1103, a 1024-bit, dyanmic, silicon-gate, MOS 
“random access memory chip, has become one of the most 
important semiconductor components ever produced. The 
. Memory system designer should consider this component 
for.new random access memory system designs, as it offers 

‘significant advantages over cores in cost, system flexibility, 

‘and performance. Fourteen out of eighteen computer 
_ mainframe manufacturers throughout the world are now 
using the 1103 memory. With more than a dozen semi- 
conductor manufacturers having announced their intentions 
to second source the 1103, this product is assured a position 
as an industry standard. 


Because the 1103 is such an important component, and be- 
“cause dynamic MOS circuits require a number of unique 
‘considerations, this special section has been devoted ex- 
clusively to the 1103. This section is divided into articles 
on the principles of operation of the 1103, basic system 
design using the 1103, special considerations such as mini- 
mizing power consumption in 1103 memories, and examples 
. of 1103 system design. 


MEMORY MATRIX: 


READ/ 
WRITE 32 ROWS 
AMPLIFIERS 32 COLUMNS 

(1024 BITS) 


A. OPERATION OF THE 1103 


The Intel 1103 is a 1024-bit random-access, fully decoded, 
read-write memory utilizing the cell shown in Figure A-7b. 
shown in the Appendix, for storage. It is mounted in an 18 
lead dual in-line package. Figure 1.10a shows a block dia- 
gram of the part, while Figure 1.10b shows the pin connec- 
tions. A more complete diagram is shown in Figure 1.11. 


The memory is organized as 32 rows of 32 cells each. Five 
address lines, Ay through A,, are decoded to select one row 
of cells. When accessed, the contents of this row are trans- 
ferred to a row of 32 refresh amplifiers. In the course of a 
memory cycle, whether read or write, the data is regenerat- 
ed and written back into the selected row of cells. Address 
bits A; through A g are decoded to select one refresh ampli- 
fier for communication with the data input and output ter- 
minals. Data output is sensed as a current. Activation of 
the “write” clock effectively disconnects the refresh ampli- 
fier outputs from the write data lines and permits the sig- 
nal on the data input line to over ride the signal at the out- 
put of the selected refresh amplifier. 


READ/WRITE 
Vss 
CENABLE 

A 


4 


PRECHARGE DATA OUT 


REFRESH AMPLIFIERS, 
READ/WRITE COLUMN A A 
GATING 9 8 


1 OF 32 
COLUMN SELECTOR 


A, Ag A; Ag Ag 


Vg Q————_——a 

Vop OO 
PRECHARGE O———____»> Bata is 
CENABLE O——————___»> 


READ/WRITE O————___> 


a. Block Diagram b. Pin Connections 


Figure 1.10. Block Diagram and External Connections of 1103 
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DATA P = PRECHARGE 
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CE = CENABLE 
W = READ/WRITE 

CE LOGIC 0 = HIGH VOLTAGE (H) 
LOGIC 1 = LOW VOLTAGE (L) 
Vag = SUBSTRATE VOLTAGE 


SWVd 


Figure 1.11, 1103 Schematic Diagram 
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' Figure 1.12 shows the basic timing of the 1103 memory 
cycle. The timing values specified for each of the input sig- 
nals (shown in Table 1.3) are those guaranteed to permit 
operation over the specified operating temperature range. * 


The cycle timing is established by the three clock signals: 
precharge, cenable, and write. Initially (prior to execution 
of a memory cycle) all clocks are at their high state, at a 
voltage approximately equal to Vgg- 


To begin a cycle, precharge is first brought low, to approxi- 
mately Vpp potential. Referring to Figures 1.11 and 1.12, 
this operation activates the row and column decoders, and 
also charges all read and write data lines negatively (i.e., to 
the equivalent of a logic “high” state for the p-channel MOS). 
In the discussion which follows, clocks, etc. are considered 

n” at Vpp level, and “off” at Vog level. “High and low” 
refer to the magnitude of charge with respect to the MOS 
substrate. 


-*While individual units may be operable typically at much 

high speeds than the guaranteed values, particularly at room 
temperature, the designer of a production system should ob- 
serve all of the limits specified in the data sheet. 


ADDRESS CANCHANGE. SOOO CAN CHANGE 


The decoder circuitry is somewhat faster than the line 
charging circuitry, so addresses need not be stable until 
somewhat after precharge is applied. The system designer 
may take advantage of this characteristic in several ways; 
for example, to utilize a less expensive (slower) address 
driver. Of course, addresses may be provided before pre- 
charge is turned on. 


After precharge and addresses have been present long enough 
for the data lines to charge and decoders to stabilize, the 
cenable clock may be turned on — i.e., dropped to its low 
state. At this time, the desired read-select line is activated 
and the read-data line charging circuits are disabled. 


These data lines begin to discharge selectively, with the sig- 
nals on them approaching values corresponding to the com- 
plements of the data stored in the selected row of cells. 


As the read-data lines selectively discharge, the precharge 
signal is turned off, i.e., raised high to Vog. This action 
removes the charging signal on the write-data lines, and 
closes a path so that the write-data lines may be selectively 
discharged. The write-select line corresponding to the se- 
lected read-select line is also activated, so that restoration 
of the cell contents occurs. The signal level on the write- 
data line isa function of the overlap time between precharge 
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Figure 1,12. 1103 Timing — Minimum Read/Write Cycle and Supply Current Variation 
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and cenable. If this overlap is too short, the read-data lines 
will not have discharged sufficiently when the discharge path 
from the refresh amplifiers to the write-data lines is closed. 
As a result, high (negative) levels written into the cells may 
be reduced. 


If, however, the overlap time is excessive, weak lows within 
the cells may result in some discharge of the read lines 
before closure of the write path back, so that cells with 
weak ‘lows have higher levels (ever weaker lows) written 
back into them, eventually resulting in lows changing to 
highs. This problem is somewhat aggravated by the small 
but unavoidable capacitive coupling between the data and 
select lines and the cell storage capacitor. 


When cenable is turned on, a path for current to flow from 
Vgg to output exists, for one column decoder is enabled 
and all write-data lines have been charged high (negatively). 
If the selected cell (the cell at the intersection of the select- 
ed column and selected row) contains a low, the write-data 
line will be discharged after precharge is removed and the 


output current will be cut off. If, however, the selected 
cell has been negatively charged, the output current will 
continue to flow. 


Cenable must remain present (after precharge turn off) for 
sufficient time to allow the contents of the selected row of 
cells to be refreshed. Even after cenable is turned off 
(raised to Vgg) the addresses should remain for a few (t,, = 
20) nsec to allow completion of internal operations. ' Pre- 

charge should not be applied again until cenable has re- 
mained off for at least t, = 85nsec. 


If the memory cycle must include a write operation (with 
or without a read operation) all sequences proceed as above. 
However, before cenable is removed, but after a sufficient 
time has been allotted for stabilization of the write data 
lines, the write line may be activated. As a result, the réad 
data lines are discharged, effectively disconnecting the re- 
fresh amplifiers from the write data line. Thus, a direct 
path from the data input to the selected tell is established. 


Table 1.3. Timing Specifications for 1103 
AC CHARACTERISTICS T,=0°C to 70°C, Vos = 16 + 5%, (Vgg —Vsg ) = 3.0V to 4.0V, Vop = OV 


READ, WRITE, AND READ/WRITE CYCLE 


TIME BETWEEN REFRESH 

ADDRESS TO CENABLE SET UP TIME 
CENABLE TO ADDRESS HOLD TIME 
PRECHARGE TO CENABLE DELAY 
PRECHARGE & CENABLE OVERLAP, LOW 
CENABLE TO PRECHARGE DELAY 
PRECHARGE & CENABLE OVERLAP, HIGH 


READ CYCLE 
PRECHARGE TO END OF CENABLE 


END OF PRECHARGE TO 
OUTPUT DELAY 


tice')| ADDRESS TO OUTPUT ACCESS 


soe PRECHARGE TO OUTPUT ACCESS 


WRITE OR READ/WRITE CYCLE 


WRITE CYCLE 

READ/WRITE CYCLE 

PRECHARGE TO READ/WRITE DELAY 
READ/WRITE PULSE WIDTH 
READ/WRITE SET UP TIME 

DATA SET UP TIME 

DATA HOLD TIME 


END OF PRECHARGE TO 
OUTPUT DELAY 


TIME TO NEXT PRECHARGE 


CONDITIONS 


t = 20 ns 


tacmin + tovimin L Croan = 100 pF 
+ teomax + 2 tr Rioap = 1002 
tecmin + fovimin Vier = se my 


+ tromax + 2 tr 


tr = 20 ns 


Croan = 100 pF 
Rioao => 1002 
Veer = 40 mV 


Note 1: ,These times will degrade by 40 ns (worst case) if the maximum values for Vy, (for precharge, cenable and read/write inputs) go to Vgg— 


14.2V @ O°C and Vgg—14.5V @ 70°C as defined on page 2. 
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A signal on this input will then overwrite the contents of 
the cell. If the read/write line remains “on” (Vpp potential) 
after cenable goes off (tew > Onsec), those conditions 
caused by excessive overlap time are aggravated and im- 
proper operation of the device may result. 


The timing specifications for operating the 1103 are shown 
in Table 1.3. All the time values listed, except tpg ,t ACC1; 
and t accg are generated by the system in which the 1103 
is installed. These times should be kept within their stated 
limits if proper operation over the full temperature range is 
to be achieved. In addition to these stated limits, the pre- 
charge duty cycle should be held below 40% to keep power 
dissipation at an acceptable value. 


The time designated tpo which refers to the time delay ob- 
served between the turn-off of precharge and the availability 
of data at the 1103 output terminals, is a characteristic of 
the part. When operated within the proper voltage and 
timing limits, this delay is guaranteed not to exceed the 
stated value of 120 nsec. 


The two access times, taccy and taccg represent a com- 
bination of system operating parameters and characteristics 
of the 1103. Thus, the stated “minimum” values of 300 
and 310nsec represent the shortest access times which can 
be guaranteed when parts are operated within the limits spe- 
cified and with rise and fall times of 20nsec. As will be dis- 
cussed later, system access times will exceed these values be- 
cause of the additional delays and tolerance introduced by 
the rest of the system. 


B. CHIP SELECT 


In operation, the cenable clock also acts as a chip select. 
That is, precharge and write signals may be applied at their 
normal times in the cycle, but if cenable is not applied, the 
part will neither deliver current to the output terminal nor 
will the contents of any cell be altered. However, no re- 
freshing of memory content takes place during such a 
cycle. 


C. CLOCK SIGNAL AMPLITUDES 


To guarantee operation of the 1103 over the full tempera- 
ture range, the clock amplitudes (precharge, cenable and 
write) must be maintained at the levels defined below in 
Table 1.4. 


The reduced performance specification given in Table 1.4b 
allows simpler clock-driver designs or permits wider power 
supply variations for a given clock driver design, but re- 
quires adding 40ns to tpc and tac. Asa result, access and 
cycle times are increased by at least 40 ns, when this less 
stringent specification is used. 
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Table 1.4 
A. Clock Signal Amplitudes — Full Performance 


Signal Minimum 
Vix (high) @ 0°C Vog—1.0  Vog + 1.0 
Vizz (high) @ 70°C = Vgg—0.7 Vg + 1.0 
Viz, (ow)@0°C == Vgg —17 
Viz, (low) @ 70°C = Vgg — 17 


B. Clock Signal Amplitudes — Reduced Performance 


Signal Maximum 
Viz (high) @ 0°C Vsg 1.0 Vog + 1.0 
Vip (high)@ 70°C = =-Vgg —0.7 = Vg +:1.0 
Vit (low) @0°C Vgg — 14.2 
Viz, (low) @ 70°C Vgg — 14.5 


NOTE: The maximum value for Vy and the minimum 
value of Vy are linear functions of temperature over the 
range 0 to 70°C and can be calculated using a straight line 
relationship. 


D. ARRAY CONNECTION OF 1103s 


To design a memory system with 1103s, a number of factors 
must be considered. The system must be capable of genera- 
ting the timing for the various clocks, etc. and distributing 
these signals at the proper voltage levels to the 1103s. Suit- 
able power must be distributed to the 1103s and peripheral 
circuitry, and means must be provided to prevent catas- 
trophic damage to the memory array in the event of peri- 
pheral circuit failure. Refresh control circuits may also have 
to be provided to guarantee that all memory cells retain 
data. 


The nature of the 1103 permits a number of units to be 
wired in a rectangular array much like that used for bipolar 
and static MOS circuits. In this way, several thousand 
words of some selected number of bits can be realized with 
a single set of peripheral interface circuits. Figure 1.13 
shows how 1103s may be so wired. When making access to 
an array such as that of Figure 1.13, the cenable clock is 
used to select a row of 1103s, while the 10 address lines 
select the word from within the selected row. In most 
cases, the system designer will probably find it most con- 
venient to generate timing signals for the 1103 memory 
system using one of the standard high speed integrated 
circuit logic families such as TTL or ECL. To operate the 
1103, the generated signals must be converted from the 
logic family levels to MOS levels. Similarly, the outputs 
from the 1103 must be converted back to suitable logic 
levels. In Figure 1.13, the small blocks labelled L repre- 
sent level shifters from the ECL or TTL levels to MOS 
levels (e.g. the Intel 3207A), while the blocks labelled S are 
sense amplifiers (e.g., the Intel 3208A or 3408A) which 
convert the output current from the 1103s back to TTL 
or ECL levels. 

The power supply connections are not shown in Figure 1.13. 
In practice, Vpp, Vgg, and Vpp connections are each made 
common throughout the array. It is especially important to 
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provide adequate distribution for these supplies as will be 
described in the section on printed circuit layout. 


D.1 Level Shift Circuits for TTL Interface 


The function of the level shift circuits is to convert from the 
TTL or ECL levels used by the system to MOS levels. To 
avoid degrading the performance of the system, fast rise and 
fall times must be maintained into the load represented by 
the array of 1103s. The level shifter must also be capable 
of holding the shifted level within rated tolerances for the 
signal line being driven. The levels for clock and address 
lines are referenced with respect to Vos. However, in actual 
practice, most level shifters use ground or the Vpp supply 
as a reference value for the low level. Asa result, the offset 
from ground which is characteristic of the level shifter, 
when combined with normal power supply variation, may 
result in an insufficiently negative level for some of the in- 
put signals. To give the system designer some freedom. in 
his choice of drivers, the 1103 is characterized for two 
different levels of clock drive signals as outlined in Table 
1.4. In many cases, a driver with a higher offset voltage 
may have lower cost or better load drive capability than 
those with low offset voltages. The designer may choose 
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to sacrifice some speed to permit use of a net lower cost 
set of level shifters. Tables 1.4, 1.5, and 1.6 list some of the 
characteristics of the 1103 which are relevant to level 
shifter design. 


The signal lines from an 1103 memory array represent signi- 
ficant capacitive loads. For example, consider a 4096 
word, 16 bit per word memory array. If a connection equi- 
valent to that of Figure 1.13 is used, the array will consist 
of 4 rows of 16 devices each. The worst case capacitances 
‘due to the memory parts are then as follows: 


Each of 10 address lines 64 x 7pF = 488pF 
Each of 4 precharge lines 16 x 18pF = 288pF 
Each of 4 cenable lines 16 x 18pF = 288pF 
Each of 4 write lines 16 x 15 pF = 240pF 
Each of 16 data input lines 4x 5pF = 20pF 


In addition to these loads, the wiring capacitance of the 
printed board, etc., used to connect the array must also 
be considered. To estimate the average charging and dis- 
charging current I required to change the voltage by a value 
of AE on a capacitor of value C in a time At, the relation- 
ship I = CAE/At may be used. 


If C is in picofarads (pF), E in volts and t in ns, the units of 


Table 1.5. Input Level Requirements — 1103 Address and Data Lines 


SYMBOL 


INPUT LOAD CURRENT (ALL !NPUT PINS) 


see tae 
OUTPUT LEAKAGE CURRENT 


INPUT LOW VOLTAGE 
(ALL ADDRESS & DATA-IN LINES) 


Vit2 INPUT LOW VOLTAGE 
| (ALL ADDRESS & DATA-IN LINES) 


CONDITIONS 


Vit a-2)| INPUT LOW VOLTAGE (PRECHARGE 
CENABLE & READ/WRITE INPUTS) 


Vv call) INPUT LOW VOLTAGE (PRECHARGE 
CENABLE& READ/WRITE INPUTS) 


Vina!) | INPUT HIGH VOLTAGE 
(ALL INPUTS) 


Vina (1) | INPUT HIGH VOLTAGE 
(ALL INPUTS) 


Note 1: The maximum values for Vj, and the minimum values for Vj}; are linearly related to temperature between O°C and 70°C. Thus any value 
in between O°C and 70°C can be calculated by using a straight-line relationship. 

Note 2: The maximum values for Vit (for precharge, cenable & read/write) may be increased to Vgg—14.2 @ O0°C and Vsgsg—14.5 @ 70°C (same 
values as those specified for the address & data-in lines) with a 40 ns degradation (worst case) in tac. tpc. tro two: tawe. tacc1 and tacc?. 


Table 1.6. Capacitance Value of 1103 Signal Lines* 


ADDRESS CAPACITANCE 
PRECHARGE CAPACITANCE 
CENABLE CAPACITANCE 
READ/WRITE CAPACITANCE 
DATA INPUT CAPACITANCE 


DATA INPUT CAPACITANCE 


DATA OUTPUT CAPACITANCE 


Vin = Vss 7 


Vin = Vss 


Vin = Vss f = 1MHz 
All Unused 
Pins Are 


CENABLE = ov| AtA-C- 
Vic = Ver Ground 


CENABLE = Vss 
Vin = Vss 


Vour = OV 


Vin = Vss 


*This parameter is periodically sampled and is not 100% tested. They are measured at worst case operating conditions. 
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current I will be mA. Thus, for 20ns rise and fall times and 
16 volt transitions, each of the precharge and cenable lines 
will require a drive current which averages about 230mA 
over the transition. The peak charging current may be some- 
what higher than this value. Thus, the circuits used to drive 
1103 arrays must be capable of providing high peak currents. 


Although a capacitive load dissipates no power, when a 
capacitive load is charged, energy is drawn from the driver 
power supply. Some of this energy is dissipated by the 
driver while the remainder is stored in the capacitor. When 
the capacitor is discharged, its stored energy is dissipated in 
the driver. If a capacitor C is first charged to a voltage V 
and then discharged back to ground at.a rate of f times 
per second, the minimum average current I drawn from 
the driver power supply will be given by I = fCV. If fis in 
megahertz, C in picofarads, and V in volts, I will be in 
microamps. If the power supply is also of voltage V, the 
power dissipated by the driver due to the capacitive load 
is given by P = fCV2, where P is in microwatts if the same 
units listed above are used. Thus, worst case dissipation 
due to this factor for a precharge or cenable driver in the 
4k x 16 memory listed above would be about 120mW per 
driver at V = 16 volts, C = 288pF and f = 1.6MHz. This 
dissipation must be added to any other dissipation associat- 
ed with the driver. 


The capacitive loading associated with the array may also 
produce ringing if the leads from the drivers to the array 
have any significant inductance. Series damping resistors 
must usually be used. The choice of the damping resistor 
will usually be a function of the layout used and the num- 
ber of 1103s driven. In Figures 1.14 and 1.15 a nominal 
value of 10 ohms is shown in several of the circuits. In 
some cases a different value may be found to give better 
results. Figure 1.14 illustrates a number of level shifters 
for use with TTL logic. 


Table 1.7 lists some of the characteristics of these drivers. 
The level shifter of Figure 1.14a is used primarily for driving 
data input lines. In very small arrays, or arrays where de- 
graded performance can be tolerated, it may be used for 
driving address lines as well. A load resistor of 1k is shown, 
however, other values may be used. Of course, higher values 
reduce power dissipation but degrade speed. Lower load 
resistor values will speed up the level shifter, but increase 
power dissipation and may, if too low, exceed the current 
sinking capability of the driving gate. 


The capacitive load driving capability of the driver of Figure 


1,14a may be increased by adding a booster stage, as shown 
in Figure 1.14b. The complementary emitter follower con- 
siderably increases capacitive drive capability, as can be seen 
from Table 1.7. However, the emitter base drop of transis- 
tor Qo in Figure 1.14b raises the low output level to about 
+1 volt, If Vgg is allowed to fall to 15.2 volts, the output 
may fall out of the specified operating range at elevated 
temperatures and degrade the performance. 


When using a driver such as that of Figure 1.14b, the designer 
has several choices: 


1. He can restrict the allowable range (or tolerances) of the 
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MPS—U02 


102 


1/6 SN7406 


MPS—U52 


EN3014 


EN3014 


EN3014 


OuTPuT 0 0, OUTPUT 
DATA INPUT DATA INPUT 
ENABLE INPUT ENABLE INPUT 
ENABLE INPUT ENABLE INPUT 
DATA INPUT DATA INPUT 
ouTPuT 6 QUTPUT 


Figure 1.14. TTL to 1103 Interface Circuits 


Vgg supply to values which do not require speed degrada- 
tion. For example, a Vgg range of 15.7 to 16.8 allows 
precharge, etc. inputs not to exceed Vgg —14.7 and still 
maintains Vgg in the proper range. 


2. He can raise Vog by approximately 1V and compensate 
by biasing the Vpp return from the 1103 array at about 
+1V. (A diode in series with the Vpp return results in a 
nominal 0.7V Vpp return value, but requires some reduc- 
tion in allowable Vgg tolerances.) 
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Table 1.7. Typical Output Characteristics of Level Shift Circuits 


Low Voltage (V,, ) 
No Load 
Sinking 3 mA 


Rise time (t,) ') 


10 pF 25 ns 
50 pF 90 ns 
100 pF 
200 pF 
470 pF 


Fall time (t,) ‘?) 
10 pF 12 ns 
50 pF 20 ns 
100 pF — 
200 pF — 
470 pF — 


Driver (c) 


0.25V ~OV 0.04V 
+0.97V 0.14V 
High Voltage (V,,, ) 
No Load Vgg + 0.65V 
Source - 3mA os 
Sink x 0.5 mA 


Veg + 0.01V 
Vg — 0.05V 
Vg + 0.0111) 


Vsg + 0.01V 
Vsg — 0.12V 
Veg + 0.45V(1) 


10 ns 
15 ns 
20 ns 
25 ns 
50 ns 


6 ns 
9ns 
12 ns 
20 ns 
35 ns 


This voltage level is a function of transistor reverse gain. A diode clamp to Vgg is recommended. 


. 2. These values are measured between the 10% and 90% points. 


The driver of Figure 1.14b holds the output line very close 
to Vog when the output is high. However, if the layout 
permits positive transients to be capdcitively coupled to the 
driver line, the driver may not be capable of holding the 
line within tolerance. The reason for this is that the posi- 
tive transient clamping capability of the driver is a function 
of the reverse gain of transistor Qo. To insure that the high 


es output remains within limits, it may be necessary to adda 
-.@lamp diode (shown dotted in Figure 1.14b from the output 


line to Vgg)- 


' The driver shown in Figure 1.14c has somewhat reduced 
capacitive load driving capability from that of Figure 1.14b, 
but does not have the offset problem for low outputs. 
Again, positive transient clamping for high outputs is limited 
by the reverse gain of transistor Q 9 so a diode clamp to Vgg 
may be desirable. 


Figure 1-14d shows a monolithic integrated quad level shif- 
ter and driver, the 3207A, that is available from Intel. Each 
of the 4 drivers in the 16 pin dual in-line package can drive 
up to 200pF, with rise and fall times under 30 and 35ns. 
respectively. 


D.2 Level Shift Circuits for ECL Interface 


In Figure 1.15, a circuit for converting from ECL levels 
(ECL biased between ground and —5.2V) to MOS levels 
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(Vpp = GND, Vgg = +16) is shown together with the per- 
formance characteristics of the circuit. This circuit does 
not have significant negative offset, but may require a diode 
clamp to Vgg to reduce positive going, capacitively — 
coupled noise on the output line. A negative clamp to 
ground may also be desirable, as ringing can sometimes 
result in charging the output line to an excessively negative 
value. Excessively negative clock values may effectively re- 
duce the timing tolerances for t yy. 


ECL GATE 


—5.2V —5.2V —5.2V 


C, | DELAY TO FALL}! FALL TIME | DELAY TO RISE| RISE TIME | 
(10%) (10% to 90%) (10%) (10% to 90%) 


300 pF 16 nsec 22 nsec 20 nsec 30 nsec 


Figure 1.15. ECL to MOS Level Converter 


oa ® 
i 
D.3 Sense Amplifier Circuits 


The output of the 1103 is a current which is present fora 
logic “1’’ output and absent for a logic “O”. This current 
is usually converted to a logic level by a sensitive differen- 
tial amplifier which measures the voltage drop across a resis- 
tor of a few hundred ohms, Larger resistor values slow per- 
formance, while smaller values result in very small signals 
which are difficult to sense. Table 1.8 lists some of the rele- 
vant 1103 output characteristics. 


Figure 1.16 shows several different circuits for converting 
the 1103 output current back to standard logic levels. The 
first choice for a sense amplifier is the Intel 3208A or 3408A. 
The 3208A, shown in Figure 1.16a, is a hex sense amplifier 
specifically designed for use with the 1103. The 3408A, 
shown in Figure 1.16b, is similar to the 3208A, but in- 
cludes an internal hex latch so that the 1103 outputs may 
be stored and delivered to the data bus at a later time. 


In Figure 1.16a, a sense resistor R, is shown. In practice 
each sense input which is used would have a sense resistor 
R, to ground. Reference voltage for the 3208A (or 3408A) 
is generated by the voltage divider consisting of resistors 
R ,and Ro. 


The reference voltage of the 3208A and 3408A may be de- 
rived from either Vgg or Voc. The output current of the 
1103 is directly proportional to Vgg. To maintain close 
tracking of 1103 output current and sense amp. reference 
voltage, it is recommended that the reference voltage for 
the sense amp. be derived from Vgg. Due to the sensitivity 
of the sense amplifier input, the reference voltage must be 
well filtered and decoupled from noise and ripple. Any 


noise at the reference line may be falsely recognized as an 


input signal. 


The choice of sense resistor for use with the 3208A and 
3408A is influenced by two considerations, Larger values of 
resistor produce larger 1103 output voltages, but increase the 
time constant of the data output bus, and therefore increase 
the access time of the memory. Lower values result in high 
speed, but reduce the noise immunity of the system. If the 
resistor is too small, the minimum output “1” level of the 
1103 may not be large enough to exceed the “‘1” threshold 
of the sense amp. An optimum value of sense resistor and 
reference voltage may be defined as the values which result 
in the last added circuit delay given a desired amount of 
noise immunity. The optimum value may be computed 
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easily if the following assumptions are used (also see note 
below): 


1. The added delay due to an increase in data bus time is 
approximately equal to the increase in time constant. 


2. The added delay due to a change in current sensing 
level for the 1103 can be estimated at .1Ins/uA — that is, 
if the equivalent current level for sensing zeroes from the 
1103 is reduced by 10uA, the time delay is increased by 
1 nsec. 


Based on these assumptions, the optimum value of reference 
voltage is given by the equation: 


2x Vn + Vp 
Veet = Lmin 
10C 


where V,o is in millivolts and 


L nin = Minimum “1” level output current of the 1103 in uA. 
Vay = desired noise immunity in mV. 


Vp = maximum threshold variation (Vgz- Vgz). For 


3208A = 50 mV, for 3408A = 60mV. 
C = data bus capacitance in pF. 


If V,o¢ falls outside the range of 100 to 200 mV, then the 
value (100mV or 200mV) which is closest to V.e¢ should 
be used as V,or. 
Once V, ¢¢ has been chosen, the value for the sense resistor 
Rg is given by 
(1000) x (Ve¢ + Vay) 

Re= 

: Imin 
where Rg is in ohms, V,,¢ and Vy are in millivolts, and 
I minis in WA. 
Using these assumptions, the increase in access time AT, of 
the memory is given by 


Ta “toa t | Ipep- |—————— } k — + x 
BONER Bie 001 10 1000 


where t,, = sense amplifier delay in nsec, I.,- = defined 
measuring current for 1103 access time and Vier, Inin, Re 
Vand C are as defined above. 


As an example, consider a data bus capacitance of 100pF 
and desired noise immunity of 25mV. Then for an 1103 


Table 1.8. Output Characteristics of the 1103 


900 | uA | Ta = 25°C 


OUTPUT HIGH CURRENT 


POUTErt HIGH CURRENT 


| OUTPUT LOW CURRENT 


See Note 


800 LA | Ta = 70°C 


OUTPUT HIGH VOLTAGE 


Ta = 25°C A 


OUTPUT HIGH VOLTAGE 


Ta = 70°C, 


OUTPUT LOW VOLTAGE 


NOTE: The output current for the low state is the sum of 


See Nott 


the 1103 leakage and externally coupled noise. Voy, =Ior° Rs 
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with I .;, = 500KA and I ref™ 400uA, Thus the sense amp (32084) circuitry contributes an addi- 


can tional 64 ns to the access time of the system. 
Vref =500 / ————=150mV 
10 x 100 


175 
Rg= x 1000 = 3502 
500 


x 100 = 64nsee 


400 750 350 — 100 
—_—_ —— + —— 
10 35 


AT, =20+ |—— 
A 1000 


a. 3208A Block Diagram b. 3408A Block Diagram 
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c. Using SN75107/75108 e. Using a Discrete Transistor 


4.7K 


STROBE OR ENABLE 


SENSE LINE SENSE LINE 


OUTPUT 
DUMMY LINE —— 


(OPTIONAL) 


Ro 
200 STROBE OR 
ENABLE 


f. Conversion to ECL Levels 
d. Using Low Power TTL poe 
SENSE LINE 


ECL GATE 
SENSE LINE 


Figure 1.16. Sense Amplifiers 


Figure 1.16 also shows several-other circuits for converting 
the output current to logic levels. 


The circuit shown in Figure 1.16c uses one half of a 
SN5107 or SN75108 line receiver as a sense amplifier. 
The SN75108 requires a differential signal of at least 25mV 
for guaranteed operation. The high input side may require 
up to 75uA input current. The SN75107/8 may be used 
with balanced or unbalanced input. In Figure 1,.16c the 
SN75108 is shown with a balanced input circuit. Current 
from the 1103 array is sensed across resistor R,. 


The reference threshold is established by the combination of 
resistors R; and Ro, with Ro usually made equal to Rg. 
Using the minimum “1” level current of 500uA from the 
1103 and the 25mV and 75yuA characteristic of the 
SN75108, the reference level must be less than 
(.50 —.075)Rg — 25mV, where Rg is in ohms. However, 
the reference level must be in excess of 25mV for proper 
detection of zeroes, For this reason, Rg must be larger 
than 10092 when using the 75108. A value of 200 to 4002 
is more appropriate. 


In some systems, the effects of capacitively coupled noise 
on the data output line may be reduced by adding a dummy 
line in the array. This dummy line runs parallel to the sense 
line, but makes no connection, The noise signals then couple 
more or less equally into both lines and appear as common 
mode signals at the inputs to the SN75108. However, the 
output currents from the 1103s couple into only one line, 
and therefore contribute to a differential between the lines. 


The SN75108 may be also used as an unbalanced sense 
amplifier by grounding one of the differential inputs and 
adding a negative bias current to the sense line input. 
This current may be supplied by a suitable resistor returned 
to a negative supply voltage. 


The two versions of the part differ in that the SN75107 
has a'totem-pole TTL output, while the SN75108 has an 
open collector output. The SN75108 may be used to 
advantage in large systems, where several memory modules 
may share a common data bus. 


A comparator, such as the wA710 or MC1710, could be 
used as a sense amplifier in a similar fashion. The tighter 
differential voltage tolerance (6.5mV vs. 25mV) and lower 
bias current (40uA to 75uA) permit lower sense resistor 
values. However, the lack of strobe signal input and/or 
open collector output, and the more awkward power sup- 
plies required by the 710 make it a less desirable choice. 


Unlike the 3208A and 3408A, the SN75108 requires an 
additional negative supply for its operation. Where per- 
formance can be sacrificed, sensing can also be done with- 
out an extra supply by using circuits such as those of Figure 
1.16d and 1.16e. In Figure 1.16d, low power TTL is biased 
to allow direct sensing of the output current. In Figure 1.16e 
a discrete transistor circuit is used for sensing. Each of 
these circuits adds significantly to the access time of the 
system. 


Figure 1.16f shows one possible connection for conversion 
to ECL levels. In this circuit the 1103 output terminals 
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are biased about 1.5V negative with respect to Vpp. The 
relatively high impedance required adds delay to the system 
as described above. Another means for conversion to ECL 
levels is to use a discrete pnp transistor difference amplifier 
between the 1103 sense line and the ECL gate. 


E. SYSTEM TIMING 


Many potential users of the 1103 have experienced some 
confusion as to the meaning of “minimum” and “‘maximum”’ 
as used in relation to the 1103 specifications. For a sys- 
tem-generated time value, such as t pw or tgy;, the value, 
as measured at the terminals of any 1103, must fall within 
the stated limits. That is, for example, the time between 

turn-on of cenable and turn-off of precharge (t g1,) must 
fall within the range of 25 to 75nsec. Operation outside of 
this range may. cause inferior performance. Those time 
values which represent direct measurement of 1103 proper- 
ties, such as tpg (the time from precharge turnoff until data 
is available at the output) are specified as not to exceed the | 
stated maximum values. However, 1103 access and cycle 
times (taccp tacc2 two trwo tro) represent a combi- 
nation of system driving characteristics and 1103 character- 
istics. The minimum values stated represent the minimum 
system cycle time (or system access time) for which all 
parts are guaranteed to operate. These minimum values 
are achieved by operating with all drive signal timing values 
set to their minima and the access time strobing signal set 
at a value corresponding to the maximum tpg. Of course, 
by selecting devices, a cycle faster than the stated minimum 
might be achieved. In a practical system, skews in driver 
and logic circuitry will usually prevent operation with all 
time values set at their minima, and delays in sense ampli- 
fier and latches will add to access time. | 


In generating timing signals for an 1103 memory system, 
the skews (variations in the delays of gates and drivers) in 
the timing path must be considered. As an example, con- - 
sider a system in which the driver delay (from input transi- 
tion to start of rise or fall) is 15 + 5 ns. In addition, let 
each timing signal pass through two gates, each with delays 
in the range of 5 to 10 ns. Thus, for each timing signal gen- 
erated, a delay of 30 + 10 ns and a rise time of 20 + 5 ns 
is experienced. In Figure 1.17, each of the timing signals 
have been expanded to include the effects of these skews 
while still maintaining the timing requirements. All signals 
are assumed to have independent skews. 


The delay spread for gates within a single package will 
typically be much less than the spread for the logic family 
under consideration. In some systems, it may be possible to © 
take advantage of this reduced spread to achieve higher per- 
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formance. Using the skew figures listed above, the shortest 
read/write cycle becomes almost 700 ns in length. A read- 
only cycle could be as little as 580 ns long. Of course, each 
system designer must use figures characteristic of his own 
system. 


E.1 Delay Line Controllers 


There are several techniques for generating the timing sig- 
nals needed for the system. One such technique, shown in 
Figure 1.18, uses a delay line and controller to produce 
cycle timing signals. This circuit also includes control for 
refresh cycles, and generation of a ready/busy signal. At the 
end of each normal (non-refresh) cycle, the memory goes 
“ready”, signalling readiness to accept another memory 


PRECHARGE 


cycle request. A single shot multivibrator controls refresh 
cycle requesting timing — a cycle is requested every 60us. 
If such a refresh request is pending, at such time as the 
memory is ready, the refresh request is accepted and a re- 
fresh cycle is executed. However, the memory control con- 
tinues to indicate “ready”. Any normal requests received 
during the execution of the refresh cycle are acknowledged 
by the controller (by going “busy”), but are not executed 
until after completion of the refresh cycle. Thus, refresh 
cycles are visible to the normal requester only as occasion- 
ally longer access and cycle times. 


Cycle timing in the circuit of Figure 1.18 is established by 
launching a signal down the delay line. This signal, a 
square pulse of one half cycle in length, is used to generate 


CENT | 


CENABLE \ \ / J 


DATA OUT f XN 


DATA STROBE [ l 


Figure 1.17. 1103 System Timing 
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Figure 1.18. Delay Line Controller for Generating System Timing Signals 
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the various timing signals by performing logic at the various 
taps on the line. It should be noted that the most critical 
timing is associated with the overlap interval — i.e., the time 
between the turn-on of cenable and the turn-off of pre- 
charge. As shown in Figure 1.17, with the skews shown, the 
source timing signals must have a spacing of exactly 70 ns. 
To make the system requirements more realistic, several 
choices are open to the designer: 


a. Provide several delay line taps for precharge turn-off and 


cenable turn-on timing. The appropriate tap is selected for | 


proper overlap values. 

b. Generate overlap timing at (or closer to) the drivers to 
reduce the number of gates in the timing path. A suitably 
. delayed signal derived from cenable turn-on may be used to 
control precharge turn-off. A variation of this technique 
utilizes a discrete circuit to detect turn-on of cenable at the 
1103’s MOS levels themselves. The output of this circuit 
is used to control precharge turn-off. Figure 1.19 shows 
such a circuit. The delay line can usually be a single LC 
lumped stage. 
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Figure 1.19. Local Control of Tov 


The Intel 3207A driver provides input leads which can 
tolerate MOS level inputs. As a result, the logic of Figure 
1,19 can be implemented with fewer components when 
using the 3207A. 

c. More closely matched skews in the path from the timing 
generation to the cenable and precharge drivers by insuring 
equivalent gates are located in common package. If inte- 


grated circuit drivers are used, corresponding precharge and 
cenable drivers should be located in the same package when- 
ever possible. The Intel 3207A, quad level shifter and driver, 
is specified to have differential delays within one package 
not to exceed 10ns for equal loads of 200 pF. In addition, | 
rise and fall times of drivers within'a common package will 
not differ by more than 10ns under these conditions. . 
d. Faster logic, such as Schottky TTL, may be used in the 
critical paths of the timing control circuits. 


The access time for a system with timing as shown in Figure 
1.17 is at least 450ns plus any additional delays associated 
with starting the cycle and propagating the data to the mem- 
ory output. 


E.2 Shift Register Based Controller 


The circuit shown in Figure 1.20 uses clocked JK flip- 
flops wired as a shift register instead of delay lines to gene- 
rate timing for an 1103 memory. Operation is otherwise 


' similar to the controller of Figure 1.18. However, access — 


time is increased by the time which elapses from the occur- 
rence of the cycle request until the next clock pulse arrives. 
If timing similar to that of Figure 1.17 is used, the clock 
must operate at a 70ns period. All other requirements 
could be met if an 840 ns read/write cycle were used. How- 
ever, by using the tpyy; control circuit of Figure 1.19, 
much greater freedom is offered the designer. 


The shift register controller results in a longer memory cycle 
than the delay line controller because the timing cannot be — 
optimized with the shift register. When cost is more im- 
portant than speed, the shift register may be a more favor- 
able choice, as the flip-flops used typically cost less than the 
delay lines. 


Of course, many other controller designs may be used, as 
long as the timing requirements of the 1103 are met. 


E.3 Refresh Control 


The maximum time interval between accesses to a row of 
memory cells within the 1103 (t ppp) is specified as 2ms — 
over the normal operating temperature range. To guarantee 
that data is retained within the memory, at least one read or 
write cycle must be executed for each row of cells within 
any 2ms interval. As the rows are selected by address in- 
puts Ag through A¥, at least 32 memory cycles, one for 
each state of address lines Ag through A, must be executed 
in each 2ms interval. These cycles may in some cases re- 
sult from normal accessing; such as when a memory is used 
in a sequential-access mode. In other cases special refresh 
cycles must be executed. 


In any of the systems described above, during refresh cycles 
the address is derived from a refresh address counter, rather 
than from the normal address source. Figure 1.21 shows 
how address generation and switching may be accomplished. 
Only the 5 bits controlling address lines Ag through A, need 
to be switched during refresh cycles. In larger memories 
where cenable (and sometimes precharge) are decoded to 
act as a chip select signal, the decoding may be over- 
ridden during refresh cycles. In this way, the entire mem- 
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Figure 1.20. Shift Register Based Controller for Generating System Timing Signals 


ory is refreshed with the execution of only 32 cycles. When 
this type of “bulk” refreshing is used together with the 
“local” toyz, control circuit of Figure 1.19, the skews in 
turn-on of cenable drivers must be controlled to insure 
meeting the toy, specification during refresh cycles. The 
two controllers described above each execute a refresh cycle 


every 60us so that 32 such cycles are executed within 2ms. 


Refresh cycles are, of course, read cycles, and may, if the 
user desires, take advantage of the shorter cycle available 
when reading only. In small memories with slow cycles, 
it is sometimes convenient to use every other cycle as a 
refresh cycle. 


For some applications, the normal cycles executed may 
achieve meeting the refresh requirement. For example, 
random access memories used for maintenance of CRT 
displays are often operated in a “mostly sequential” access 
mode, or may have some recurring accesses which are se- 
quential. If these sequential cycles access three percent or 
more of the memory in any 2ms period, it may be possible 
_ to arrange the address connections such that these cycles 
alone are sufficient to maintain the data. 


Another example where normal use can maintain the data 
involves transfers from disk or drum and 1103 memory. 
In some cases, it may be desired that these transfers take 
place at full memory rate, with no cycles devoted to re- 
freshing. In most cases, addressing may be arranged so that 
the transfer itself refreshes the memory. Consider trans- 
fer between a 4096 word 1103 memory and a disk. Such 
transfers usually involve serial blocks of data. If the address 
bits include 1103 addresses Ay through A4, and the two bits 
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(1) Ry, Ry, Ro, Rg, Rg come from refresh address counter. 


(2) SA, SA4, SAz, SA3, SA, are addresses from memory 
address register (supplied from normal requesting circuits). 


Figure 1.21. Refresh Address Switching 


decoded to perform row selection, then any transfer of at 
least 128 consecutive words will refresh the entire memory. 
Using this organization, the controller generated refresh 
cycles may be inhibited during execution of such transfers 
without the loss of data. 


F. POWER CONSIDERATIONS 
F.1 Power Supplies 


Signal and power supply levels are important to the proper 
operation of the 1103. Speed is a function of both the Vsgs 
level and the clock amplitudes. In general, higher ampli- 
tudes or voltages result in faster operation. However, volt- 
ages outside of the specified operating ranges may result in 
marginal operation or more critical timing. Substrate bias 
Vpp also has an effect on performance. This bias improves 
noise immunity and prevents parasitic interaction within the 
device, but larger values of bias result in slower operation 
and lower output current. 


Between Vog and Vpp the 1103 presents the equivalent 
of a silicon junction diode, which is reverse biased under 
normal operating conditions. During power supply turn on, 
Vpp Should rise at least as fast as Vog and during operation, 
Vp should not fall below Vgg. To insure proper Vag reg- 
ulation and to guarantee these turn-on characteristics, Vaz 
is best generated by a 3-4V power supply in series with 
Vgg or by regulating Vog at 3-4V below Vpp - Because the 
1103s draw very little current from Vpp> some protective 
series resistance may be used, providing that Vpp is ade- 
quately bypassed to Vog at the 1103s, and that any level 
shifters connected to Vpp are connected to the power 
supply side of the series resistance. 


In spite of these precautions, shorting Vpz to ground may 
result in the Vog supply delivering destructive current 
through the 1103s. Again, a current limit on Vgg to provide 
protection, or deriving Vog from’ Vpgp is advisable. The 
current handling capability of the device is limited by bond 
wire and metallization current capability. Typical 1103s 
appear to have a voltage drop of about 0.7V with an addi- 
tional series resistance of about 52 and can be destroyed 
by energies in excess of .01 joule, or steady currents in ex- 
cess of 100mA. For cases where current limit protection 
cannot be used, a power supply crow bar may be desirable, 
to shunt the Vog supply in the event of a Vpp short cir- 
cuit or precharge driver failure. A Schottky barrier rectifier 
from Vgg to Vpp may also be used to draw Vgg to ground 
when Vpz is shorted. This type of rectifier has a lower 
voltage drop than the junctions of the 1103. Conventional 
silicon rectifiers also offer some, but reduced, protection. 
In each case, one or more rectifiers are connected such that 
the cathode is connected to Vpp and the anode to Vgg. 


F.2 Power Consumption 


Power consumption of the 1103 is associated primarily with 
current flow from Vgg to Vpp. This current varies signifi- 
cantly during execution of a memory cycle. The maximum 
consumption of power occurs during the interval when pre- 
charge is active, particularly when both precharge and cen- 
able are both active. Thus power consumption is a strong 
function of the precharge duty cycle. 


In Figure 1.12, the current flow from Vgg to Vpp is 
shown in addition to the driving signals. Table 1.5 itemizes 
the absolute maximum ratings and limits on power supply 
current for various parts of the cycle. When precharge is 


first applied, a surge of current flows associated with the 
charging of internal capacitances. The peak current is a 
function of precharge turn-on fall time, and is typically 
80-100mA @ 20ns. This current rapidly decays to a typi- 
cal values of 37mA (Ip ,) until cenable is made active. 
Most of this current is associated with the decoder circuits. 


During the overlap period (precharge and cenable both low), 
a somewhat higher current (Ippo) flows. When precharge 
is removed, the device current typically falls to below 11 mA 
(I ppg) remaining at this level through the remainder of the 
cenable active period. 


When cenable is removed, the current falls to a low level, 
under 4mA (Ipp,). This current is associated with the cir- 
cuits which generate internal signals R and R (see Figure 
1.11). The circuit generating R draws current when address 
line A 4 is low and that generator R draws current when in- 
ternal signal A, is low. Both currents may be blocked to - 
implement a low power data retention mode of operation © 
for 1103 memories. The technique is described in the 
standby power dissipation section (page 1-27). 


To estimate the power drawn by an 1103 memory system, ~ 
the memory array power, the level shifter dissipation, and 
the controller dissipation must all be considered. The 
power consumed by the memory array is a function of the 
memory cycle timing used, with precharge duty cycle and 
the number of devices made active within the ey the 

most important parameters. eacse 


Referring to the de power characteristics of the 1103, as 
itemized in Table 1.9 and for the cycle used as reference, 
1103’s executing memory cycles draw a maximum of 25mA. : 
average current. Devices which are not executing memory 
cycles may draw a maximum current (Ipp, of 4mA. 
Unless the memory cycle is altered such that the duty cycles . 
of the various clocks are significantly different than the 
reference cycle, these figures will apply. For example, 
consider a 4k x 16 (4 rows of 16 1103s each) memory, ine 
which only the selected raw is driven with precharge. 
Maximum average power supply current for the array is 
then 16 x 0.025 + 3x 16 x .004 = .593 amp. 


For memory cycles which differ significantly in timing, the 

average current per device executing a cycle may be esti-. 
mated from the duty cycles associated with precharge, . 
overlap, cenable, and transition periods. 


To a first approximation, the user can base current require- 
ments on precharge-on to precharge-off duty cycles (includ- 
ing transitions), precharge-off to cenable-off duty cycle, 
and the cenable-off to start of precharge turn-on. For the 
reference cycle of the 1103 data sheet, these times are as 
follows: 


full precharge interval: 
t pct tov. + 3 tr= 210 nsec 
precharge-off to cenable-off: 
tpovt tr 
or: tpwt twp +2 tr = 285 nsec 
cenable-off to precharge-on: 
t cp= 85nsec 
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Using these current figures for these intervals and neglecting 
the extra current during the overlap period results in an 
estimate within a few percent of the current actually ob- 
served. 


Executing memory cycles, the 1103 dissipation can approach 
400mW. Between cycles, dissipation is less than 64mW. 
A significant amount of power can be saved in an 1103 
system by applying precharge only to those rows of 1103s 
which are to be accessed. The same circuits used to decode 
address bits to drive cenable are also used to select which 
precharge drivers are to be operated. 


When precharge is decoded in the same manner as chip 
select, noise problems can result within the 1103 unless 
at least one of the following precautions is taken: 


1. Decode “write” in the same manner as precharge and 
_. cenable. 

“2, Apply a minimum 40ns “sliver” of precharge to the 
‘unselected devices in the array rather than fully block 
- -'. precharge. 

-- 3, Reduce the high level of the precharge driver so that it 
~” falls in the range of Vgg —0.5 to Vgg —1.5. 


It is also undesirable to operate the array in a mode where 
‘write cycles are continuously executed at the same address. 
‘This action may aggravate noise levels in the system. 


Arrays of 1103s draw large surges of current during normal 
- operation. Power supplies must be adequately regulated 
and bypassed at the memory array to insure proper main- 
__ tenance of the required voltages. Distribution of bypass 


RAMs 


capacitors is discussed in more detail in the section on 
printed circuit layout. 


A. major contribution to power dissipation in 1103 mem-. 
ory systems is made by the level shifter circuits. Level 
shifters of the types shown in Figures 1.14a and 1.14b 
dissipate some 400mW each when the output of the level 
shifter is low (Veg = 16V, Vaz —Vgg = 4V). A typical 
4k x 16 memory requires ten address level shifters, 16 
data level shifters, and from 6 to 12 clock level shifters de- 
pending upon the clock decoding scheme used. Because 
the clock level shifters have relatively low effective duty 
cycles, the de dissipation is relatively small, perhaps 300- 
400mW total. However, worst case dissipation for the 26 
address and data level shifters could exceed 10 watts. Some 
reduction in power may be achieved by gating the data level 
shifters such that they deliver low outputs only during the 
cenable period of write cycles. In large systems, addresses 
may be held high on all but selected memory modules in 
order to conserve power. 


In level shifters which are driving large capacitive loads, the 
power associated with charging and discharging the capaci- 
tances must also be considered. This power, given by the 
relationship P = fCVv? (see above in the section on level 
shift circuits) is most significant in clock driver circuits. 
However, in high performance, heavily loaded address 
drivers it is very important to prevent switching transients 
from producing an effectively higher frequency and corres- 
pondingly higher dissipation. For example, in a 4k x 16 
memory, using a 700nsec cycle, addresses would typically 
change at most once per cycle, corresponding to a maxi- 


Table 1.9. Absolute Maximum Ratings and Power Characteristics for the 1103 


1103 ABSOLUTE MAXIMUM RATINGS* 


Temperature Under Bias 

Storage Temperature 

All Input or Output Voltages with 
Respect to the Most Positive 
Supply Voltage, Veg 

Supply Voltages Vop and Veg 
with Respect to Vee 


0°C to 70°C 
—65°C to +150°C 
—25V to 0.5V 


—25V to 0.5V 
Power Dissipaticn 1.0 W 


“COMMENT: 


Stresses above those listed under ‘Absolute Maximum 
Rating’ may cause permanent damage to the device. This ts 
a stress rating only and functional operation of the device 
at these or at any other condition above those indicated in 
the operational sections of this specification is not implied. 
Exposure to absolute maximum rating conditions for ex- 
tended periods may affect device reliability. 


eee ee 00€._ OO —————— 


1103 POWER CHARACTERISTICS 
T= O0°C to+70°C, Vin = 16V + 5%) (Veg — 


Vga SUPPLY CURRENT 


SUPPLY CURRENT DURING Tope 


ALL ADDRESSES = OV 
PRECHARGE = OV 
CENABLE = Vgg : Ta = 25°C 


$ 
SUPPLY CURRENT DURING Toy 


ALL ADDRESSES = OV 
PRECHARGE = OV 
CENABLE = OV: T, = 25°C 


| SUPPLY CURRENT DURING Tpoy 


PRECHARGE = Veg 
CENABLE = OV: T, = 25°C 


SUPPLY CURRENT DURING Top 


PRECHARGE = Veg 
CENABLE = Vesi Ta = 25°C, 


lop Ae? | AVERAGE SUPPLY CURRENT 


NOTES: 1. These values are taken from a single pulse measurement. 


CYCLE TIME = 580 ns; PRECHARGE 
WIDTH = 190 ns; Ta = 25°C 


2. This parameter is periodically sampled and is not 100% tested. 
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mum frequency of half the memory cycle rate. Using a 
value of 16 volts and 448pF, the dissipation associated with 
capacitance charging is about 84mW. If, however, “glitches” 
result in two or more additional address transitions prior 
to starting a cycle, this figure may be tripled or more. 


In larger memory systems built with the 1108, the total 
system power may be reduced by insuring data and address 
level shifters in unaccessed modules remain in a low dissi- 
pation condition. 


F.3 Standby Power Dissipation 


Low power data retention (LPDR) refers to a mode of 
operation for an 1103 memory whereby data is retained 
within the memory with minimum power dissipation. With 
a suitably designed 1103 memory system, the power required 
to retain data can be reduced to below 5 watts per million 
bits of memory. This mode is used to make the memory 
appear non-volatile to the user. A small battery pack may 
be used to maintain the memory content in the event of 
the failure of the external power source. In the low power 
data retention mode, the data is not considered accessible. 
Only that power necessary for the memory to retain the 
data is consumed. When normal power is restored to the 
system, the memory is switched back to normal mode, but 
does not have to be reloaded before the system is usable. 
This feature is particularly useful for process control appli- 
cations and for small machines that do not have disk or 
tape back-up storage. 


Although in normal operation the 1103 has a potential de 
current path from Vgg to Vpp even when cycles are not 
being executed, it is possible to design a system such that 
this path is blocked when cycles are not being executed. 
The ability to block this path permits very low standby 
power operation to be achieved. _ 


To maintain data within an 1103 system, it is sufficient 
to execute the 32 required refresh cycles in each 2msec 
period. Rather than space the refresh cycles 60sec apart 
as is common in normal operation, when running in 
LPDR mode the refresh cycles are more conveniently exe- 
cuted in a burst of 32 cycles every 2 msec. With this mode 
of refreshing, no memory for refresh address selection is 
needed between bursts, so that the timing and refresh ad- 
dress circuitry may be turned off between bursts. This 
switching-off of power to the controller reduces the standby 
power associated with the controller logic. Vgp and Veg 
power supplies must be maintained at the memory through- 
out LPDR operation, although the current paths through 
the 1103 are blocked between bursts. To biock I Dp4 cur- 
rent in the 1103s between bursts, the last cycle of the 
burst should have address A, low. Subsequent to this 
cycle, A, is raised high for the between burst interval. 


Level shift circuits should be designed so that their power 
consumption can be made negligible between bursts of 
cycles, The level shifter should also maintain the output 
levels at Vog with the TTL supply turned off. To prevent 
current flow from the Vgg supply to the Vgg supply, the 
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Vp supply to the level shifters should be dropped to Vgg 
voltage between bursts. However, normal Vaz voltage 
must be applied to the 1103s, 


Those level shifters of Figure 1.14 which utilize the SN7406 
may not operate properly with the TTL supply off, for the 
clamping action of an internal transistor across the emitter 
base diode of the output transistor may be necessary for the 
output device to sustain the 16V supply. However, the cir- 
cuit of Figure 1.14C has been successfully used in 1103 
memories operated in LPDR mode. 


In LPDR mode, the Vpp> Vg and controller TTL supplies 
are maintained by batteries. For best results, these batteries 
are located on the unregulated side of the power supply. 
When main power fails, the memory is switched to LPDR 
mode, but voltages within the array do not experience 
switch-over transients. The controller TTL supply is switch- 
ed on to the logic only during execution of the refresh 
bursts, 


One form of control circuit necessary to implement LPDR 
mode is shown in Figure 1.22. A timer circuit realized by 
an RC network and difference amplifier established the 
2msec period between bursts. When this circuit times out, 
or when main power is switched on, the controller power is 
switched on. A power-on reset pulse initializes the con- 
troller and refresh address counter and resets the refresh — 
mode control flip-flop to “burst” mode. After the power- 
on reset is removed, the refresh mode control flip-flop 
requests refresh cycles until all refresh addresses have been 
exercised. The carry-out from the refresh address counter 
sets the refresh mode control flip-flop to normal after the 
32 refresh cycles have been executed. This action returns 
the memory to the mode where refresh cycles are executed 
every 60sec, and makes it available for normal cycle re- 
quests. However, if main power is not on, the setting of 
the refresh mode control flip-flop resets the 2 millisecond 
refresh interval timer and turns off the controller power. 
With controller power off, the 2 millisecond timer again 
charges for 2 msec then switches on controller power, and _ 
the process is repeated. Turning main power on or off also — 

requests execution of a burst of cycles, so that at no point 
does any location ever experience more than. 2 msec be- 

tween refresh cycles. | 


In a typical 4k x 16 system, the following power levels 
were experienced: 


Normal 
Operation ‘ LPDR Mode 
Vpp (+19.5V) 5.7 W .070 W 
Vgg (+16V) 3.2 W 145 W 
Voc (+5V) 3.6 W .090 W 
Voec— (-5V) 0.4 W = 
16.9 Watts .305 Watts 


Vcc is the +5V supply backed up by batteries while Vogo 
supplies power to circuits required only for normal opera- 
tion. V¢c_ is a supply used only for the SN75107 sense 
amplifiers. 
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Figure 1.22. LPDR Circuit 


The power dissipation in LPDR mode is essentially pro- 
portional to the number of refresh bursts executed per 
-. second. This rate is limited to 500 per second or more by DATA IN 

the 2 msec maximum interval between refreshes. However, 
the 2 msec figure applies to operation over the full range 

from 0 to 70°C. Typical parts can tolerate much longer 
. intervals between refresh cycles when operating at room 
- CENABLE 
operating temperature of the memory and vary the re- 
fresh interval accordingly. Standby power levels under one 
watt per million bits have been reported for memories at 
room temperature.* 


>b > D> 
oo +} = 


PRECHARGE 
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F.4 Printed Circuit Layout Considerations 


> 
© 


The pin connections of the 1103 permit an array to be laid 
out on %” x 1” centers if leads are run between pins of the 

_ dual in-line package. To achieve this packing density, 
leads running between pins need only be used on the com- 
ponent side of the board. Figure 1.23 shows a layout for 
use with the 1103. 


At the present time, there seem to be several schools of DATA OUT 
*D, Appelt — Providing Non-volatile LSI Memory, Digest of 


Papers, COMPCON 72, Printed Circuit Layout Considera- 
tions, Figure 1.23. PC Array Layout 
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thought concerning the use of boards with ground planes or 
ground and Vgg planes vs. two-sided boards. Although 


empirical evidence seems to indicate that the ground and 


Vgg Plane construction is superior in terms of noise within 
the system, successful two-sided layouts have been achieved. 
To keep noise low in two-sided layouts, a number of rules 
should be observed. 


1. Occasional vertical busses on Vgg and Vpp make the 
power feed more closely approximate a grid reducing power 
supply noise. 


2. Adequate bypassing of the Vgg supply to Vpp is manda- 
tory. At least .05ufd of low inductance capacitance per 
-device should be used. It is usually sufficient to use a 
_ 0.5ufd ceramic for every 8 to 10 devices in the array. One 
such capacitor is used at the end of each row of devices. 
The capacitors should be very close to the array or within 
the array. In general, no part should be further than a few 
inches from a bypass capacitor. 


Referring to figure 1.12 and Table 1.9, it may be noted 
that the 1103 draws a rapidly fluctuating current during 
execution of a memory cycle. Even if it were possible to 
design a regulator which could respond to these rapid 
fluctuations, the inductance of the leads from the power 
supply would result in poor regulation at the memory. 
(Most regulator circuits have response times of several tens 
of microseconds.) It is the function of the bypass capacitors 
to maintain constant voltage throughout the execution of a 
cycle, 


3. The Vgp supply should be bypassed to Vgg rather than 
Vpp » as the difference between Vog and Vp is the more 
important parameter. Bypasses to both Vgg and Vpp are 
commonly used, Although the 1103 draws little or no DC 
current from the Vppz supply, transient currents do flow 
.during clock transitions. Bypassing insures adequate track- 
ing of the supplies, and reduces fluctuations due to capa- 
citive coupling within the 1103. A typical bypass circuit is 
shown below: 


4. Clock drivers and address level shifters should be lo- 
cated as close to the array as possible, and should be con- 
nected to the array with short leads. The array may other- 
wise resonate with the series lead inductance and produce 
ringing. 

Excessive ringing can seriously impair memory operating 
margins, Series damping resistors and clamp diodes which 
prevent 1103 input lines from rising more than a volt. above 
the Vog supply may be utilized to impreve drive signal 
quality, but at the expense of some increase in rise time. 


5. When low power data retention is utilized, all devices in 
the array simultaneously execute a burst of 32 cycles. As 
a result, power supply surge currents are even more severe 
when this mode of operation is used. Because power sup- 
ply regulators do not respond rapidly enough, the supply 
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voltages must be maintained at the array by using additional 


bypass capacitors, For LPDR mode, an additional-1 to 2ufd 
per device should be used in addition to the ceramic bypas- 
ses described above. For example, this capacitance might 
take the form of a 100ufd tantalum electrolytic for a 64 
device array. For example, this capacitance should be lo- 
cated immediately adjacent to the array. 


6. With the availability of inexpensive integrated circuit 
regulators, local regulation may be used to advantage in . 
some 1103 memory systems, Current limiting, and the 
shutting down of Vgg in the event of Vpp short may be 
more readily achieved with local regulation. 


7. The large currents which flow during clock and address 
transitions can be troublesome if poor layout practices are 
used. These currents should not be allowed to flow to TTL 
signal or ground lines or inferior drive waveforms and even 
oscillation may result. The connections from level shifters 
to the array provide for isolation of the array current paths: 
from the TTL ground lines. 


8. In the section covering sense amplifiers, the use of a 

dummy line to achieve a balanced system was mentioned. 

Although use of this technique is optional, more care must 

be used in unbalanced systems to prevent coupling of sig- 
nals from data output busses back around through clock 

drivers, etc. High ground impedances may result in circuit 

instabilities. 


G. APPLICATION EXAMPLES 


G.l 1kx8 Memory 


The 1103 has been used in memories covering a wide range 
of sizes. Systems as small as 1k x 8 to over a million bits can 
be achieved economically and conveniently using the 1103. 


Figure 1.24 shows a photograph of a 1k x 8 memory using 
8 1103s with a clock counting controller. The system is 
synchronous, with alternate cycles reserved for refresh 
cycles. A control line permits inhibiting the refresh cycles, 
Some power can be saved by enabling refresh cycles for 
only 40-50usec out of the 2 msec period. The board in 
the photo provides address and data output latches. Level 
shifters similar to those of Figure 16a are used for address 
and data. Clocks use a driver similar to that of Figure 16b 
although that of Figure 16c could also have been used. 


Most of the design compromises for the 1k x 8 were made 
in favor of minimum cost. As a result, using medium quan- 
tity prices for the peripheral logic, the peripheral parts cost, 
including the board, amounts to less than four tenths of a 
cent per bit. In high volume production quantities, the 
overhead cost should be much less than this figure. 


Memories in the order of 1k x 8 in size find application in 
cathode ray display terminals, when the designer wants the 
advantages of random access memory over conventional 
serial memory. Other applications include buffers for peri- _ 
pheral equipment, calculator memories, etc. 
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1k x 8 1103 Memory 
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Figure 1.25. 4k x 18 1103 Memory Complete with Controller 
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G.2 4k x 18 Memory 


A very popular application for the 1103 is as a mini-com- . 


_ puter main frame memory. Figure 1.25 shows a photograph 
of a 4k x 18 memory complete with delay line controller 
similar to that of Figure 1.20. For level shifters and clock 
driver, the system utilizes the Intel 3207A integrated quad 
clock driver. This board also includes address and data out- 
put latches. The SN75108 is used for the sense amplifiers. 


For some memories, cost savings may be achieved by using 
one controller for a number of memory modules. Figure 
1.26 shows a photo of an example of a typical 1103 appli- 
cation. This memory board is available as part of a basic 
system that is being offered by Intel’s Memory Systems 
Division. The basic memory consists of two plug-in boards: 
a memory board (MU) and a control board (CU). The con- 
trol board is capable of operating up to 8 MUs in configu- 
rations of 32k words by 18 bits or 65k words by 9 bits. 
The overhead cost for the system approaches that of the in- 
dividual module in large systems. In large systems, the 
designer may also wish to utilize some of the power saving 
schemes described above. 


The large 1103 memory system can offer the system designer 
some unusual extra features at very little additional cost. 
The addition of address and data registers to each module 
permits multiple simultaneous access to the memory (al- 
though only one access may be made to each module), Ata 
slight increase in memory cycle, the data registers may be 
eliminated, and the various modules polled via the “‘strobe”’ 
input to the sense amplifiers. By using additional write 
clock drivers, a memory can be structured to have a wide 
data path for reading, and a variable width data path for 
writing. 
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H. PROTECTION AGAINST CATASTROPHIC. 
DAMAGE 


As with any semiconductor component, 1103s can be 
damaged by misuse, or mis-applied voltages. The compo- 
nent should be protected from such misuse during shipment, 
handling, and when installed in the system. 


MOS circuits are characterized by high impedances, and 

therefore are capable of being charged to high voltages by - 
static charges. The gate circuits of MOS transistors are 

subject to destructive breakdown if excessively charged. 

The 1103 has an effective gate protection circuit for all 

input connections, and requires no elaborate precautions in 
normal use. However, some environments are subject to 

extreme buildup of static charge, and are capable of re- 

leasing sufficient amounts of energy to damage any semi- 

conductor component. MOS components in particular 

should be protected from these static charges. Some pre- 

cautions which are easy to implement and yet which are 

quite effective include: 


1. Carrying components in conductive trays, such as metal 
or foil-lined pans. 


2. Having personnel touch ground, the chassis, or the car- 
rier tray before picking up components, 


3. Avoiding high-static materials and fabrics in work areas. 
In a circuit or tester, the part should not be subject to high 
voltage spikes. Although voltages somewhat in excess of 
the maximums may not cause damage if applied for short 
periods of time, voltages near the maximums should not be 
applied for long periods of time. Operation under excessive 
power dissipation conditions may also impair ere 
reliability. 


Figure 1.26. in-10 Memory System 
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In a system, adequate cooling air must be provided to pre- 
vent excessive temperature rise. As in the case of bipolar 
and static memory components, it is possible to install 
1103s at packing densities exceeding those typical of TTL. 
In arrays with undecoded precharge, power dissipation per 
package can be higher than the typical TTL dual in-line 
package. 


Power dissipation is a function of precharge duty cycle. If 
a precharge driver fails such that precharge remains perma- 
nently low within the array, excessive dissipation may result. 
Some form of overcurrent protection for the Vogg supply 
is recommended to prevent potential damage due to this 
type of failure. 
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I. SUMMARY 


The 1103, a 1024 bit dynamic-MOS random-access read- 
write memory, is now regarded an industry standard. 
Any memory system designer should be aware of the ad- 
vantages of using this component. The foregoing text was 
prepared to help the potential user understand the operation 
of the 1103, and to provide the designer with the informa- 
tion he needs to use the 1103 in practical systems. 
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INTRODUCTION 


The devices discussed in this section are also random access 
memories, and may be wired in arrays much like the fixed 
address read-write memories of Part I. However, all of the 
products in this group are characterized by having the data, 
once it is entered into the array, essentially unchangeable 
by the system using the memory array. Entering data into 


Read Only 
Memories— _ 
ROMs and PROMs 


the array is known as “programming” the read only mem-.. 
ory (ROM). 


Two categories of devices are discussed here: 1. mask-pro- ' 
grammed read-only memories, which have the data entered 


during the manufacture of the components, and; 2. field \ : 
programmable read-only memories, which allow the user to - =" 


enter data into the devices in the field. Field programmable _ 
read-only memories may be erasable or non-erasable. Eras. - 


able read-only memories provide some means for removing. 


the data from the array and making the device again avail- 
able for field programming. 


Read only memories (ROMs) are used for a wide variety 
of applications. They may be used to perform character 
generation for TV displays, code conversion, to perform 
the equivalent of logic function in controllers, and to store 
programs (particularly certain commonly used routines such 
as loaders for computers, etc.). Primary advantages of 
ROM are its non-volatility and low cost, usually much low- 
er than read-write memory. The main disadvantage, partic- 
ularly for mask programmed ROM, is the inability to change 
erroneous data. 


Table 2.1 lists the types of ROM that will be discussed in 
this section. All of the ROMs listed are general purpose 
devices, but vary in access time, eraseability, configuration, 
and programming method. 


All of the ROMs of Table 2.1 include one or more chip 
select leads and may be wired in arrays as were the read- 
write memories of Part I. All of the buffering considerations 
for p-channel MOS RAMs apply to p-channel MOS ROMs 
with the exception that there is no “write” lead to be 
bussed in the array. TTL input and output considerations 
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Table 2.1. Intel ROMs and PROMs 


p-channel MOS 
p-channel MOS 
p-channel MOS 
p-channel MOS 
p-channel MOS 
Schottky Bipolar 
Schottky Bipolar 


Schottky Bipolar 


are also similar to those for static p-channel MOS read- 
write memory. | 

Power dissipation considerations are also similar to those 
for read-write memories. However, some systems may 


Programming Programming Access Time 
Part Number Technology Technology Erasable (nsec) 


256 x 8 1000 
256 x 8 
256 x 8 
256 x 8 
256 x 8 
256 x 4 
512 x 8 or 
1024x 4 
256 x 4 


operate with switched power to reduce overall power con- 
sumption. Because ROMs and PROMs are non-volatile, 
power may be turned on and off without altering memory 
content. 
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ELECTRICALLY PROGRAMMABLE 


MOS ROMs 


The 1602, 1602A, 1702 and 1702A are 256 words by 8 
bit electrically programmable ROMs ideally suited for uses 
where fast turn-around and pattern experimentation are 
important. The 1702 and 1702A are packaged in a 24 pin 
dual in-line package with a transparent quartzlid. The 
transparent quartz lid allows the user to expose the chip to 
ultraviolet light to erase the bit pattern. Therefore, unlike 
for a metal mask ROM where a pattern cannot be changed, 
a new pattern can then be written into the devices. 


The 1602A and 1602 are packaged in a 24 pin dual in-line 
package with a metal lid. The 1602A and 1602 are intended 
for applications where field programmability and package 
hermeticity are desired. The 1602A and 1602 are not 
erasable due to the metal lid. 


There is no electrical difference between the 1602A/1702A 
and the 1602/1702 in the read mode. However, the 1602A/ 
1702A offers a faster programming time than the 1602/1702. 
The 1602A/1702A will program in 2 minutes versus 20 
minutes for the 1602/1702. The power dissipation during 
programming is also greatly reduced in the 1602A/1702A. 
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A. PRINCIPLES OF OPERATION FOR 
FLOATING GATE DEVICES 


The 1602, 1602A, 1702, 1702A all utilize the same principle 
for data storage. The storage medium for each data bit is a 
silicon gate MOS transistor. The gate of the transistor is 
left floating, completely surrounded by oxide. As manu- 
factured, the floating gate is uncharged, and all of the 
transistors are in their “off” or non-conducting state. How- 
ever, avalanching one of the junctions associated with the 
transistor produces electrons with sufficient energy to tra- 
verse the thin gate oxide layer. These electrons reach the 
floating gate and charge it negatively. With sufficient nega- 
tive charge on the gate, the MOS transistor will turn on and 
enter the conducting state. 

Once the floating gate is charged, considerable energy is 
necessary to give the electrons sufficient velocity to escape 
the gate. Typically in excess of 5eV is necessary. Sunlight 
and ordinary illumination are insufficient, and the thermal 
excitation associated with normal operating temperatures is 
far too small. X-rays and stray radiation may discharge the 
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gate, but only at dosages far in excess of those fatal to 
humans. However, special short-wave ultraviolet sources 
very effectively erase the devices by sufficiently exciting 


the trapped electrons. The 1702 and 1702A have trans- 
parent quartz lids to facilitate erasure. 


B. PROGRAMMING THE 1602/1702 


. The 1602 and 1702 are 2048 bit static field program- 
mable ROMs. As supplied, all data bits are initially ones 
(output high), with the programming operation forcing 
ones to zeroes or leaving ones unchanged. Ultraviolet 
erasure (of the 1702) restores the data to all ones. 


Figure 2.1 shows the waveforms required to program the 
1602 or 1702. During programming Vcc is held at ground 
and Vpp is biased to +12V. All voltages shown in Figure 
2.1 are measured with respect to the Vcc terminal. 


During programming, address logic level “0’’ (corresponds 
to address low during reading) is —40 to —48 volts and 
address logic level “1”? is approximately zero volts. To 
enter data at a specified address, the address signals are 
applied to the address leads, data to be entered is applied 
to the output leads, and the power supplies Vpp and 
VaG and program lead are pulsed negatively. To program 
a location such that it will deliver a low when reading, 
during programming the data input terminal must be held 
high (0 volts with respect to Vpp) and to enter a logic 
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~ 40 TO —48 
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“1” (high output when reading) the data terminal must be 
held at —40V during programming. 


Once address and data are stable, the power and program 
pulses may be applied. The power leads, Vpp and Vea; 
must be on (at their negative extreme) about lus before 
the program pulse is applied and should remain on for lps 
after the program pulse is removed. Duty cycle for these 
pulses should not exceed 2% or the device may be destroyed 
by excessive power dissipation. A typical programming se- 
quence consists of applying five 20msec pulses 1 second 
apart. 


C. PROGRAMMING THE 1602A/1702A 


The 1602A and 1702A are 2048 bit static field-program- 
mable ROMs. ‘As supplied, all data bits are initially zeroes 
(output low), with the programming operation forcing 
zeroes to ones or leaving zeroes unchanged. Ultraviolet 
erasure (of the 1702A) restores the data to all zeroes. 


The 1602A and 1702A require a different programming se- 
quence than the 1602/1702. Where the 1602/1702 draws 
a peak current during programming in the order of 1.2A, 
the 1602A/1702A draws approximately 200mA. As a re- 
sult a much higher programming duty cycle may be used, 
and programming time is reduced from the 20 minutes for 
the 1602/1702 to 2 minutes for the 1602A/1702A. 


Figure 2,1. Programming Waveforms for 1602, 1702. 
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Figure 2.2. Programming Waveforms for 1602A, 1702A 


Figure 2.2 shows the waveforms for programming the 
1602A/1702A. During programming Vcc should be held 
- at ground and Vpp should be held at +12 volts. Address 
levels are approximately —40 volts for a logic “0” and 
approximately zero volts for a logic “1”. Note that these 
levels are larger in magnitude but in the same polarity sense 
as those used for reading from the memory: 
logic “0” < Voc —4.2, logic “1” 2 Voc —2, 
where Voc = 5V + 5%. 

When programming, the negative going power supplies must 
be pulsed. Vpp is pulsed to —47V + 1V and Vgq to—35 
to—40V. Before Vpp and Vgqg are turned on, the comple- 
ment of the address to be programmed should be applied. 
After the power has been applied for at least 25us, the 
address must be returned to its true form. Some 10 or 
more us after the address has reached its true form and at 
least 100us after turning on power, the 3ms program pulse, 
at —47 + 1V, may be applied. During the interval when 
Vpp is applied, data signals must be applied to the data 
output lines. A data level of approximately zero volts will 
result in the location remaining unchanged, while a level of 
' —47 + 1V will program a logic “1” (high output in read 
mode). After the program pulse is turned off, the Vpp and 
Vqaq voltages should be turned off. This turn off should 
occur from 10 to 100 microseconds after removal of the 
program pulse. 


For best results, the 1602A/1702A should be programmed 
by scanning through the addresses in binary sequence some 
32 times. Each pass repeats the same series of programming 
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pulses. The duty cycle for applied power must not exceed 
20%. As a result, each pass takes about 4 seconds, with the 
32 passes taking just over 2 minutes. 


D. PROGRAMMERS 


Although manual programmers may be constructed for the 
1602A/1702A, the circuitry is fairly complex and manually 
programming a PROM of this capacity is quite tedious. 
There are several automatic programmers capable of pro- 
gramming the 1602A/1702A. One is an assembly using the 
Intel SIM4-03 or SIM8-01 micro computer board which 
executes a set of micro computer instructions for program- 
ming the device. It is assembled with the MP7-03 program- 
mer card and an ASR-33 teletype. 


Figure 2.3 is a diagram of the SIM4-03 or SIM8-01/ 
MP7-03 programming assembly. Figure 2.4 shows the com- 
plete programming system with the boards (SIM4-03 or 
SIM8-01/MP7-03) and control chassis (MCB4-20 for the 
SIM4-03 and MCB8-10 for the SIM8-01). For further in- 
formation about these units, refer to the MCS-4 and 8 
Users Manuals. 


E. ERASING THE 1702 OR 1702A 


The 1702 and 1702A ROMs may he erased by exposure to 
high intensity short-wave ultraviolet light at a wavelength of 
2537 A. The recommended integrated dose (i.e., UV inten- 
sity x intensity time) is 6W-sec/em2. The devices are 
made with a transparent quartz lid covering the silicon 
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Figure 2.4. MCS-4 and MCS-8 Programming System 


die. Conventional room light, fluorescent light, or sunlight 
has no measurable effect on stored data, even after years of 
exposure. However, after 10 to 20 minutes under a suitable 
source, such as the Ultraviolet Products UVS-54 or UVS-52 
(Ultraviolet Products, 5114 Walnut Grove Avenue, San 
Gabriel, California 91778), the device is erased to a state 
of all ones (1702) or all zeroes (1702A). It is recommended 
that no more ultraviolet light exposure than that necessary 


to erase the 1702/1702A should be used to prevent damage 
to the device. 

CAUTION: When using an ultraviolet source of this type, 
one should be careful not to expose one’s eyes or skin to’ 
the ultraviolet rays because of the damage to vision, or 
burns which might occur. In addition, these shortwave 
rays may generate considerable amounts of ozone which is 
also potentially hazardous, 
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PROMs-FIELD PROGRAMMABLE BIPOLAR PROMs 


The 3601 is a bipolar 256 word by 4 bit Read Only Mem- 
ory which is programmed in the field by blowing a poly- 
crystalline silicone fuse for each bit to be programmed (to a 
logic “1”). The 3601 is guaranteed to have a maximum 
access time of 70ns over the 0°C to 75°C operating range. 
It is pin compatible to the 3301A metal mask ROM. 


The 3601 may be programmed using the basic circuit of 
Figure 2.5. Address inputs are at standard TTL levels. 
Only one output may be programmed at a time. The out- 
put to be programmed must be connected to Vcc through 
a 300Q resistor. This will force the proper programming 
current (3-6mA) into the output when the Vcc supply 
is later raised to 10V. All other outputs must be held at a 
TTL low level (0.4V). 


PROGRAMMING 
PULSE GENERATOR 
(See Figure 2.6) 


WORD 
SELECTION 


Figure 2.5. 3601 Programming 


The programming pulse generator produces a series of pulses 
_to the 3601 Vcc and CS2 leads. Vcc is pulsed from a low 
of 4.5 + .25V to a high of 10 + .25V, while CSg is pulsed 
from a low of ground (TTL logic 0) to a high of 15 + .25V. 
It is important to accurately maintain these voltage levels, 
otherwise, improper programming may result. 
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The pulses applied must maintain a duty cycle of 50 + 10%. 
and start with an initial width of 1 (+ 10%) us, and increase 
linearly over a period of approximately 100 ms to a maxi- 
mum width of 8 (+10%)us. Typical devices have their fuse 
blown within 2ms, but occasionally a fuse may take up to 
400 ms. 


During the application of the program pulse, current to 
CS» must be limited to 100mA. The output of the 3601 
is sensed when CSo is at a TTL low level output. A pro- 
grammed bit will have a TTL high output. After a fuse is 
blown, the Vcc and CS pulse trains must be applied for 
another 100us. 


One circuit which can be used to generate this pulse train is 
shown in Figure 2.6, while the characteristics of the pulse 
train are shown in 2.7. 
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NOTES: 


1, High input impedance sense amplifier, to avoid shunting the programming current which must 
flow into the selected output of the D.U.P. 


2. Capacitor, to increase rise times. 


3. Delay network, to allow the D.U.P. to recover before checking the level of the selected output. 


4. One shot, to sustain programming for 100us after a ‘1’ level has been detected at the output 


of the D.U.P. 


5. Delay network, to inhibit PROGRAM START until the ramp generator has been reset. 
Unless otherwise noted, all resistors are in kQ, % watt, all NPN transistors are 2N3646, and all 


diodes are 1N914, 
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Figure 2.6. 3601 Programmer 
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Figure 2.7. Pulses During Programming 
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NOTES: 
1. Duty cycle: 50 + 10% 
2. Rise and Fail Times (measured from 


10-90% points): 100ns min., 
303ns. max. 

3. Tep (Chip Disable Hold Time): 
100ns min. 


4. Phase shift between Voc and CS2: 
50ns max. @ 7.5V. 
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PREPARING DATA FOR ROMs and PROMs 


Entering the data into a ROM is called “programming” the 
ROM. Whether programming a field-programmed device or 
preparing to order masked-programmed devices, the data to 
be entered into the array must be prepared and very 
carefully checked (although erasable devices are more 
forgiving of errors than the other types). These data are 
usually in the form of truth tables or their equivalent, such 
as suitably prepared paper tapes, etc. 


In the case of mask-programmed devices, the truth tables, 
tapes, computer punched cards, etc. are submitted to the 
integrated circuit manufacturer, who uses them to prepare 
at least one of the photo masks used for processing the 
wafers of read-only memory devices. He also uses this data 
to prepare test programs for testing the wafers after they 
are through the wafer fabrication process. Properly funct- 
ing devices are assembled, tested, and delivered to the 
customer, 


Although the mask used for ROM programming is usually 
the metal mask, chosen because it is used at a late stage in 
the processing, the turnaround time for mask-programmed 
ROM’s is still often several weeks. In addition the user 
incurs a significant charge for the mask and special handling 
and testing necessary. 


Because of the time and _ cost associated with 
mask-programmed ROM’s, most potential users develop and 
check out their truth tables using read-write memory 
simulators or field-programmable ROM’s. 


Data for field programmable ROMs is often prepared in the 
same form as for mask-programmed devices. Intel’s SIM4-03/ 
MP7-03 microcomputer set ROM programmer requires ini- 
tial inputs via a suitably prepared paper tape. 


When preparing truth tables and paper tapes, it is extremely 
important that careful attention be paid to logic level 
definitions, address connections, etc. A ROM is of little use 
if the pin assignments don’t agree with those required by 
the system for which it was intended. 


Intel has adopted the following conventions for paper tape 
preparation: 


1. Tapes are prepared using ASCII Code as produced by an 
ASR-38 teletype terminal. 
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2. The tape contains a leader of at least 6’’ of blank or 
rubbed-out tape. 

3. Following the leader, the tape contains one data record 
for each ROM word (256 records for 1302, 1602, 
1602A, 1702, 1702A, 3301A, 3601; 512 or 1024 for 
the 3304). 

4. Each data record starts with the letter B, and is 
immediately followed by a string of P’s and N’s. There 
is one “‘P”’ or “‘N”’ for each bit in the word. The last P 
or N of the record is followed by an F. 

5. The first data record represents the contents of address 
“0”? i.e. all address lines at their low or negative 
extreme. This record is followed by a record for address 
1(all low but address AO), address 2 (all low but address 
Al) etc., in (binary) sequence. The last record 
represents the data for the location selected when all 
address lines are high or at their positive extreme. 

6. The data records contain P’s where the corresponding 
data bit is to be a high or at the most positive output 
level, and an N where the output is to be low or at the 
most negative output level. 


The first P or N of each data record corresponds to the 
most significant or highest numbered output bit, while 
the last P or N of each record corresponds to the value 
for the lowest numbered output bit or least significant 
output bit. (Note that these numbers apply to Intel’s 
signal labels, not to package pin numbers). Between the 
highest and lowest output line labels, the remaining 
outputs are entered in descending sequence. 

7. Following the last data record, a trailer of at least 6’’ of 
blank or rubbed-out tape must appear. 

8. No character other than B, P, N, or F should appear in a 
data record. However, between data records (after F 
and before B) any other characters except B’s and F’s 
may appear. It is reeommended that a carriage return 
and line feed be inserted after every fourth data record 
to allow listing of the tape on an ASR-33 teletype. 


When ROMs or PROMs are to be programmed at Intel, the 
data information may also be sent in the form of computer 
punched cards. See the Intel Data Catalog for the format. 


ROM IMPLEMENTATION OF 


A. COMBINATORIAL CIRCUITS 


Digital circuits are often divided into two categories: 
combinatorial and sequential. Combinatorial circuits have 
no internal storage elements. As a result, the output signals 
are functions only of the inputs supplied at the time the 
output is measured (neglecting propagation delays). A ROM 
may be used to generate combinatorial functions when the 
number of input signals is not excessive. For example, a 
256 word by 4 bit ROM has 8 input leads (addresses) and 4 
output leads and so can be used to generate any 4 
combinatorial functions of 8 variables. Additional 
functions may be generated by adding more 
ROM’s — doubling the number of ROM’s doubles the 
number of functions which can be generated. 


Expanding the number of input variables is much more 
costly, however. Additional input variable may be decoded 
to operate chip selects just as additional addresses inputs 
are decoded in a memory array. However, each additional 
input variable doubles the amount of ROM required. 


Various authors have expressed the option that 8 to 16 bits 
of ROM are equivalent to one logic gate. However, this 
ratio does not apply to all designs. For example, to make a 
quad full adder (5 outputs, 9 inputs) would require 5 x 29 
or 2560 bits of ROM, but can be realized with less than 40 
gates — for a ratio greater than 64 bits/gate. 


When using ROM to replace wired logic gates, the designer 
should remember that the ROM is not guaranteed to give a 
single output transition for a single input transition. Figure 
2.8 illustrates the way the designer should view the ROM’s 
behavior. In Figure 2.8, after a short hold time, the outputs 
are undefined until a period equal to the ROM’s access time 
has elapsed. During this undefined interval, the ROM 
outputs may show noise or extra transitions. Not all ROM’s 
specify a hold time. Even when a hold time is specified, it is 
valid only when access to a location has been made, and is 
measured from the first address transition. 


B. SEQUENTIAL CIRCUITS 


Sequential circuits are logic circuits with internal storage. 
As a result, outputs are a function of past as well as present 
inputs. Sequential circuits are often realized by a collection 
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Figure 2.8. ROM Behavior for Combinatorial Logic 


of storage elements (flip-flops) together with combinatorial 
logic. Outputs of the sequential network are combinatorial 
functions of the inputs to the network and the flip-flop 
outputs. The inputs to the flip-flops are combinatorial 
functions of network inputs and flip-flop outputs. 


When a sequential digital system is described in the above 
manner, the state of the circuit is determined by the 
contents of the flip-flops. Therefore, a machine with n 
flip-flops can have at most 2” internal states. To describe 
the circuit behavior, two sets of information must be 
known: 


1. The outputs as function of inputs and internal states; and 
2. The next states as functions of inputs and internal states. 


This information may be presented via tables or graphically 
in the form of a state sequence diagram, such as that shown 
in Figure 2.9a. The state sequence diagram is usually drawn 
as a collection of circles, each labelled to correspond to one 
state of the machine. The circles (states) are connected by 
directed lines (arrows) indicating which state transitions may 
take place. Each such transition line is labelled with the 
values of the input variables for which the transition takes 
place, unless the input variables have no effect. In that case, 
the state transition always takes place and the arrow is 
unlabelled. 
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i fe 
Some digital circuits are clocked, i.e., state transitions take 
place only upon occurrence of a clock pulse. If for some 
input conditions no state transition takes place at a clock 
time, it is indicated on the diagram as an arrow which leaves 
and re-enters the same circle. This arrow is labelled, like any 
other, with the corresponding input conditions. Clocked 
sequential circuits are readily designed using clocked 
flip-flops of the JK or D variety such as those shown in 
Figures 2.9b and 2.9c. 


PULSES 


b, Corresponding Sequential 
Circuit — JK Realization 


c. Corresponding Sequential Circuit — 
D Realization 


Figure 2.9. Sequential Circuit State 
Diagram and Realizations 
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B.1. State Assignment 


The state-sequence diagram describes the digital circuit 
behavior independent of the assignment of states to the 
circles of the diagram. Each circle in the diagram must be 
assigned a unique set of values for the state variables. Each 
state variable can take on the value of 1 or 0, so that n state 
variables can provide values for up to 2” circles in the 
diagram. However, the way the values are assigned to the 
circles can make a significant difference in the ease of 
realization when JK or D flip-flops are used. At present, no 
known technique, other than repeated trials, exists for 
determining the minimum cost state assignment. The 
designer’s insight and experience contribute significantly to 
the design efficiency. However, when ROM’s are used, state 
assignments are less critical than for realization with wired 
logic gates. 


B.2. Asynchronous Input to Clocked Systems 


When a clocked system has asynchronous input variables, 
i.e., variables which can change at other than clock times, 
proper behavior may depend upon the state assignment 
used. For example, if the values of a given asynchronous 
input variable can affect the values of two state variables in 
a given state transition, differential delays in the logic may 
allow 4 rather than 2 possible state changes to take place: 
neither, either, or both of the variables may change. To 
avoid this situation, state assignments should be such that 
only one state variable is a function of each asynchronous 
input variable or the asynchronous input variable should be 
made synchronous by clocking it into a flip-flop. Of course, 
the latter procedure increases the response time of the 
system to the input signal. 


These considerations also apply to the asynchronous 
flip-flop forcing inputs. In general, these inputs can force 
the network into one or more. of a subset of the states 
where it will remain until the forcing input is removed. If 
the network clocked transitions attempt to change more 
than one forced state variable, asynchronous removal of the 
forcing signal may result in any of several state transitions: 
any or all of the variables attempting to change may do so, 
depending upon differential delays in flip-flop responses, 
clock distribution, and distribution of the forcing signal. 


B.3. Realizations with D Flip-Flops 


Having assigned state variable values for each state, 
realization with D flip-flops is very straightforward.* First, 
a truth table or set of Karnaugh maps is prepared. The 
source variables include all state variables and all input 
variables. The functions to be generated involve all state 
variables (next state value) and all output functions. Those 
functions representing the next state values are used as the 
data inputs to the corresponding D flip-flops. 


*For sequential networks wired with logic gates, JK 
flip-flops may reduce the gate count as in Figures 2.9b and 
2.9c. However, ROM realizations are more economical 
when D flip-flops are used, because fewer functions need be 
generated. 


intel’ 
Figure 2.10 shows a symbolic diagram of such a network. 
The “clocked register” is an array of n D-type flip-flops. 


A read only memory array with p address inputs and q 
outputs (2P xq bits) can generate a total of q output 
functions of p inputs. Thus for Figure 2.10, if n state 
variables are required, p-n input variables may be used and 
q-n output signals may be generated. 
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Figure 2.10. Realization of Digital Machine 


Because a ROM’s internal realization is quite different than 
that of a conventional combinatorial logic network, 
different considerations apply to ROM designs than for 
conventional designs. For example: 


1. State variable assignment has little or no effect on 
circuit complexity when ROM realization is used. 
Therefore, the designer may use state variables to form 
output functions directly with greater ease than for 
conventional designs. If, however, additional logic 
circuits are added to reduce total ROM requirements or 
allow asynchronous input variables (see next 
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paragraph), some of this design freedom may be 
removed. 

2. All outputs of a ROM must be considered functions of 
all inputs. Therefore, asynchronous inputs to the ROM 
should not be permitted to change within an access 
time prior to clocking the output register, or the 
contents of the output register may be completely 
unpredictable. Additional latches or separate logic 
between the ROM output and D flip-flop inputs should 
be used so that the conditions described above (under 
Asynchronous Inputs to Clocked Systems) can be met. 
Additional ROM outputs may be used to enable or 
disable this logic. 


B.4. Methods of Reducing ROM Size 


If the number of input or output variables is large, a 
straightforward realization with ROM may not be practical. 
However, it may be possible in certain areas to reduce the 
amount of ROM’s by adding a small amount of additional 
logic. Several methods for reducing the size of a ROM 
needed to perform a given function are described below. 
The use of these techniques when appropriate may permit a 
ROM approach to be used in a situation where it would 
normally be impractical to do so. Most of these techniques 
are illustrated in Figure 2.11. | 


Multiplexing Input Variables 


Instead of using all input control variables at all times, ° 
many digital machines have only a few states where the 
next state decision is affected by the input variables. 
Therefore, a multiplexer may be used to select the input 
variables which are active for each given state of the . 
machine. The effective number of input control variables at 
the ROM may be reduced to a number equivalent to the 
largest number active at any one time. 


The control signals for the multiplexer may be generated by 
logic circuits which decode the state 
information or by extra output vari- . 
ables from the ROM. In general, these 


SAHIABLER extra ROM output variables are far less 
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INPUT that the next state is a simple func- 
re tion of the input variables, a small 
amount of logic may be placed be- 
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FUNCTIONS tween the ROM output and the 


clocked register. Some of the input 
control variables are then brought 
into the system via this logic rather 
than through the ROM. As in the 
case with input multiplexing, addi- 
tional output signals may be used to 
enable this logic. 7 


Figure 2.11. ROM Realization of Sequential Machine 
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One simple form of this method uses a multiplexer between 
the ROM output and the clocked register. Certain of the 
state variables take on the values of the input variables 


whenever the multiplexer is set to accept these inputs. This 
method places restrictions on state assignment. 


A similar technique is usually necessary for use with input 
control variables which are asynchronous. 


Output Function Distribution 


When a large number of control functions must be 
generated, but only one or two are active at one time, data 
distributors may be used to generate a large number of 
control functions from a few ROM outputs. As an example 
of the type of coding which might be used, 8 
non-simultaneous control functions might be generated 
using one data bit and 3 selection bits..The Intel 3205 
_ decoder may also find use in ROM output expansion. Eight 
selection signals can be generated from three ROM function 
outputs. 


External State Generators/Partitioning/Factoring 


~ When a large number of states of a state diagram fall in an 
easy to generate sequence, the number of state variables 
generated by the ROM may in some cases be reduced by 
- generating the additional states with external circuits such 
‘as counters or shift registers. Functions of these separately 
realized state variables may be used as equivalent state 
variable inputs to the ROM. 


As an example of this technique, consider a binary counter 
- connected to a ROM such that the ROM can generate a 
preset or count enable variable and accept a carry output as 
equivalent to a state input variable. The ROM may be 
programmed so that for some states of the conventional 
state variables, the counter counts from its preset values 
until it overflows with the ROM staying the same state 
throughout the counting sequence. In this example, one 
input (equivalent state) variable replaces all of the state 
variables in the counter. 


The example above is a special case of a more general 
technique which may be called, partitioning. Instead of 
using an external counter with the ROM system, another 
" ROM/register sequential machine might have been 
connected. The net result is a ROM/register realization of a 
sequential digital machine in which not all state variables 
are used as inputs to all of the ROM. In effect, the 
machine is partitioned into a number of smaller, but 
interactive, machines. 


"To partition a circuit, the state variables must be isolated 

into two or more groups. A new state diagram can be 
generated for each group. In these partitioned state 
diagrams, the state variables for one state diagram are 
treated as if they were input control variables in the other. 
In general, for partitioning to be effective, the state 
variables must be such that they can be divided into 
relatively independent groups. 


These examples are but a few of those available to the 
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designer wishing to take advantage of ROM. As the design 
complexity progresses, the structure approaches the 
complexity of a microprogrammed processor— one 
application where ROM is extensively applied. 


ROM, even in complicated networks like that of Figure 
2.11 or a microprogrammed processor, offers much easier 
modification of machine structure than wired logic. With 
the availability of programmable ROM’s, ROM approaches 
to sequential circuit design merit serious consideration. 


Even when a prototype system has been developed using 
the 3601 or 1702A, once ROM patterns have been fixed, 
the prototypes can be easily converted to use the 3301A or 
1302 for production for these mask programmed devices 
are pin and signal compatible with their field programmable 
counterparts. 


EXAMPLE OF A ROM 
APPLICATION 


To show one simple application of ROM, consider the 
signal generator shown in Figure 2.12. An 8-bit counter 
driven by an oscillator drives a 2048-bit ROM (256 words 
of 8 bits). The ROM outputs are converted to an analog 
voltage by a digital to analog converter (DAC). By properly 
coding the ROM, a wide variety of waveforms may be 
generated. . 


For the system shown in Fig. 2.12, each step of the 8-bit 


360 
counter represents ose of phase angle. The value at 


each address in ROM is the digital number representing the 
signal output for that phase angle. Multiple ROM/DAC 
combinations might be used to generate several 
simulataneous waveforms, or multiple phases of a signal, for 
example. The output of the DAC’s will change in small 
discrete steps, each less than 1% of full scale. Where this 
might be a problem, filtering might be used. However, 
undesired harmonic content of the signal will be limited to 
the upper harmonics. 


ROM ANALOG 
256 WORDS X 8 BITS wasn OUTPUT 


Az 


8 BIT BINARY COUNTER 


CLOCK 
OSCILLATOR 


Figure 2.12. Digitally Controlled Waveform Generator 
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INTRODUCTION 


One of the first semiconductor memory devices to find 
wide application was the MOS shift register. Two properties 
of MOS IC technology are uniquely compatible with the 
design of shift registers: the high impedance associated with 
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Serially Accessed 
Semiconductor 
Memory— _ 
SHIFT REGISTERS — 


the gate circuit permits temporary storage of charge on the - 
parasitic capacitances, and MOS technology permits 
realization of bi-directional transmission gates which have 
zero dc offset. With the transmission gate, a gate-node may 
easily be connected to, or disconnected from, other points 
in the circuit. 


The shift register structure offers layout economies as well. 
Few interconnections are required, basic shift register stages 
may be made using little silicon area, and no decoding or 
other “overhead” circuits need be placed on the chip. These 
features have made the shift register one of the lowest cost 
semiconductor memories on the market. 


Shift register memories are already extensively used in 
computer display terminals, where the data to be displayed 
are recirculated through the shift register memory in 
synchronism with the CRT display. Shift registers are also 
used as the memory of most electronic calculators. By 
providing address counters and a comparator, the shift 
register is often used as a low-speed, random-access 
memory. For example, in line printers and card readers 
where access time is relatively non-critical, the MOS shift 
register may be used as a low-cost buffer memory. To lo- 
cate a datum in a serially accessed memory, a sequence of 
interviewing locations must first be accessed. Magnetic 
tapes, disks, and drums are all serially accessed. 
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PRINCIPLES OF OPERATION 


There are many variations on the basic shift register. A 
typical p-channel MOS shift register circuit, i.e., half of a 
dual 100-bit shift register (Intel 1406) is shown in Figure 
- 3.1. Each bit of the shift register requires six MOS devices. 
Note that the devices of bit 2 have been labeled QoA 
through Qop. The input datum to the stage is the charge on 
the gate of Qoa, deposited by the previous stage. When 
clock ¢9 goes negative (for p-channel devices), devices Qo 
and Qop form an inverter stage. If the charge on the gate of 
Qa is sufficiently negative to cause it to conduct strongly, 
the node common to Qe, Qop, and Qac will approach 
Vcc (a positive level). However, if the charge at the gate of 
Qe is positive enough to leave Qga cut off, when $2 
becomes negative, the common node will approach Vpp, 
the negative supply. 


2H SCHEMATIC FOR SECOND 100 BITS SAME AS ABOVE. 
7 
' 


At the same time, when ¢9 goes negative, Qoc conducts, 
charging the parasitic gate capacitance of Qgp (shown as 
capacitor C in Fig. 3.1) to the same potential as the node 
common to Q9a, Qap, and Qac. When ¢¢9 is removed, the 
gate of Qop retains its potential. Pulsing $1 negative then 
transfers and again inverts the datum depositing it at the 


input to the next stage. 
In the circuit shown, an output device is provided so that 


charge need not be retained on external leads. This output 
buffer also acts as an inverter. To make input and output 
levels compatible, an input inverter is provided. In Fig- 
ure 3.1, the data is transferred to the output during clock 
phase 1. Thus, the data is available during clock phase 2. 
Data at the input must be available prior to and during 
phase 2. 


1 t 


Veo pit100 Vee 


*FOR 1407, 1507 OPTIONS 
Hes: 

Vee = 4 | 2OuT 2 
Vpb =8 | (6) 
$, =5 + 
¢2=3 i 

! 

100 BITS 1 OVeg 


Figure 3.1. MOS Shift Register 
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The previously discussed circuit is a dynamic shift register, 
using only capacitive storage. To retain the data stored in 
the register, the rate at which the data is circulated (or 
clocked) through the register must not fall below some mini- 
mum, typically 1kHz at room temperature. The minimum 
data rate is proportional to the ambient temperature. 


Static shift registers may also be manufactured using MOS 
technology. The static register can retain the data indefinite- 
ly if stopped at the proper point in the clock cycle. How- 
ever, static shift register cells are significantly larger than 
the dynamic cells and consume much more power. For 
these reasons, not only are static registers more expensive, 
but static registers are usually slower than dynamic registers. 


The earliest MOS shift registers used high-voltage, metal 
gate technology, and required 15 to 20 volt power supplies 
and 30 volt clocks. However, with the introduction by Intel 
of p-channel silicon-gate technology in 1969, shift registers 
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could be operated from 10 to 14 volt supplies (+5, —5, or 
+5, —9) and with clock swings of about 15 volts. Further- 
more, by biasing the positive power terminal at +5 volts, 
TTL compatibility on data input and output become 


achievable. 


The compatibility of MOS shift register with TTL logic was 
further enhanced by the development of a 5 volt, n-channel, 
silicon gate MOS process. The Intel 2401 (dual 1024 bits) 
and 2405 (single 1024 bits) are n-channel, dynamic recircu- 
lating shift registers with a data rate of 1MHz at 0°C to 
70°C. Only a single 5V supply is required for operation. 
The input, output, and clock level is completely TTL com- 
patible. The 2401 and 2405 require only a single TTL 
level clock. Power dissipation is typically 120uW/bit. A - 
photomicrograph of the 2401 is shown in Figure 3.2. 


Table 3.1 shows some of the varieties of dynamic MOS shift 
registers which are currently available. 


NX 


Table 3.1. Shift Registers 


Part Number Technology Organization 
256 x 4 

512 x 2 

1024 x1 


512 x 1 recire. 


p-channel 
p-channel 
p-channel 
p-channel 


100 x 2 
100 x 2 
100 x2 
100 x 2 


p-channel 
p-channel 
p-channel 
p-channel 
n-channel 


n-channel 1024 x 1 recire: 


NOTES: 


[1] 


1024 x 2 recire |} 
{1] 


1. 1405A, 2401 and 2405 have recirculation logic on the chip. 
2. The 1402A, 1403A, 1404A, and 1405A may also be operated at +5V, —9V. Refer to the Intel Data Catalog for specifications, 


TTL Comp 
Data Clocks 


Data Rate 


Power suppres 
(Max/Min) } 


(Volts)! 


5MHz/10kHz 
5MHz/10kHz 
5MHz/10kHz 
2MHz/10kHz 


2MHz/10kHz 
2MHz/10kHz 
2MHz/5kHz 
2MHz/5kHz 
1MHz/25kHz 
1MHz/25kHz 


3-3 


intel’ | Shift Registers 


CIRCUIT CONSIDERATIONS 


A. P-CHANNEL MOS TO TTL INTERFACE 


All of the p-channel MOS devices listed in Table 3.1 are 
TTL compatible on data input and output by means of at 
most a single added resistor per input/output pin. 


With the shift register biased so that the Vqq pin is +5 V 
and the Vpp pin is at —5 V, a TTL gate (biased between 
ground and Vc) can drive the data input pin of the shift 
register. To meet rated signal levels, a pull up resistor 
should be used for logic gates with active pull-ups. A suit- 
able resistor on the output terminal allows a single TTL 
load to be driven. The connections are shown in Figure 3.3. 
Figure 3.3 also shows an interface from the output of one 
register into another, along with a table of resistor values 
which apply to the different interfaces. The 1407 and 1507 
have an internal pull down resistor of approximately 20K. 
As a result, these devices do not require an MOS to MOS 
interface resistor. 


For the devices in Fig. 3.3 operating at +5V and —SV, the 
clock amplitude is between +5 and some negative voltage, 


IN 


DTL/TTL 


9, = 


NOTE: For the 2401/2405, an external resistor to VDD 
is not required. An external output resistor to Vcc may 
be required depending whether the user connects the 
device's internal output resistor, 


in the range of —9 to—12V. The positive swing should ap- 
proach Vcc with a tolerance of +0,3V, —1.0V, so that no 
substrate current flows and no transmission gate remains 
conductive during the opposite clock pulse. Exceeding 
either limit can reduce the ability of the gate nodes to store 
charge. The effects may prevent operation or, at best, 
raise the minimum operating frequency. 


While some variation in power supply and clock amplitudes 
may be tolerated, deviations of sufficient magnitude may 
reduce the range of frequency of operation from both ends. 
Excessive supplies may result in problems due to field 
inversion, i.e., creation of conducting channels under leads 
carrying the excessive voltage. These channels correspond 
to undersired connections between points of the circuit. 
Low supply voltage may lower the maximum and raise the 
minimum operating frequency. 


B. CLOCK DRIVER REQUIREMENTS 


Clock driver circuits must be capable of producing the 
proper clock voltage levels and timing. These circuits must 


DTL/TTL 


1402A/3A/4A 1402A/3A/4A 1406/1506 1407/1507 
Veco =5V Vec =5V Veco =5V Veco =5V 
Vpp =—5V Vpp =—3V Vpp =—5V Vpp =—5V 


3.0K 


20K 


Figure 3.3. Shift Register to TTL Interface 
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also be capable of driving the significant load represented 
by the clock input terminals. For the devices listed in 
Table 3.1, each clock terminal presents a load of approx- 
imately 0.1 to 0.2 pF per bit. 


Thus, for a system of 10,000 bits, a capacitance of 1,000 to 
2,000 pF can be expected. Large transient currents flow in 
clock driver circuits. A 15 V swing in 150 ns requires 
0.1 mA average current per picofarad. As a result, clock 
drivers capable of driving a 10,000-bit memory with 150 ns 
rise and fall times must provide 200 mA drive currents. Rise 
and fall times this long would limit operation to below 
1.2 MHz. 


The clock drivers must not only be capable of driving the 
capacitive load, but must do so without excessive ringing, 
because ringing voltages may violate the restrictions men- 
tioned in Section A. In addition, when at the positive ex- 
treme (off condition), the clock driver should present a low 
impedance. As all shift registers exhibit some clock to clock 
coupling capacitance, drivers with insufficient damping 
capability (too high an impedance in the high state) may 
allow positive or negative spikes to be coupled from one 
clock to the opposite phase. Such coupled spikes, like any 
other noise, can degrade performance. 


TTL GATE 


SEE TEXT 


b. Clock Driver with Non-Saturating Output Stage 
Figure 3.4, Shift Register Clock Driver Circuits 


Figure 3.4 shows two clock driver circuits which may be 
used to drive shift registers. 


The typical waveform produced by a clock driver operation 
in a shift register system is similar to the idealized 
waveform shown in Figure 3.5. 


Table 3.2 lists some of the characteristics of the clock 
driver waveforms as produced by the clock drivers of 
Figure 3.4. The parameters in the table are labelled on 
Figure 3.5. The measurements of Table 3.2 were taken 
using a test load. 


The test load specified in Table 3.2 is as shown in 
Figure 3.6. This test load models load and coupling 
capacitance for a typical shift register. The two capacitance 
values in the table correspond to C, and Co, respectively. 


The values in Table 3.2 are derived by applying a pulse train 
to input 1 while holding driver 2 in the high output state by 
suitable signal on input 2. Output rise and fall times are 
measured at test point 1 while coupled signals are measured 
at test point 2. 


In the driver of Fig. 3.4a, resistors R,, Ro, and Rg control 
the damping and limit charging current. The diode serves to 
apply some negative bias to the base of Qg when the driver 
is in the high state. This bias improves the “ability of Q3 to 


damp positive transients. 


The driver of Fig.3.4b uses a transistor operated in a 
non-saturating mode (Q, ) to provide low output impedance © 
in the high state. The output impedance will be the sum of | 
resistance Ro and the output impedance at the collector of 
Q,. Positive pulse damping is limited by the current 


COUPLING DUE TO 
OPPOSITE PHASE COUPLED 


HIGH LEVEL MAXIMUM 


CLOCK WAVEFORM OUTPUT 


TURN OFF DELAY 


LOGIC INPUT 


Figure 3.5. Clock Driver Waveforms 


TEST POINT bee POINT 
#1 


DRIVER as DHIVER 
#1 


Figure 3.6. Clock Driver Test Load 
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through R,. Rg limits the negative charging current to 
avoid excessively negative coupled pulses. Because of the 
limited positive clamping capability, loads in excess of 
2,000 pF are not recommended unless R, is lowered. 
However, a lower value of R,; may require replacing the 
TTL gate with a TTL buffer/driver gate. 

For very small loads, e.g., C; = 200 pF, saturation effects in 
transistor Q3 may become significant. Adding capacitor C 


and diodes Dg and D, to the circuit of Fig. 3.4b makes Q3 
a non-saturating transistor and speeds turn off time. (The 
line from the collector of Qo to the base of Qs must be 
opened at point x in Fig. 3.4b). When capacitive coupling 
like that of capacitor C is used, care must be taken that 
noise on the Vpp supply does not cause Qs to conduct and 
produce stray negative clock pulses. 


Table 3.2 Clock Driver Characteristics 


Driver/Load Delay to | Fall Time | Delay to 


Circuit a, R; =0 
10,000 pF/1100 pF 


Rise Time | Coupled | Coupled | High Level | Low Level 


Min. 


Circuit a, R7 =100 
~ 1000 pF/110 pF 


Voc —.95 


Voc —.95 


Circuit a, R;= 100 
200 pF /22 pF 


Vcc —.8 | Voc — 5 | VDD 


Circuit b, 
8000 pF /220 pF 


Circuit b, 
1000 pF/110 pF 


Circuit b, 

200 pF /22 pF 

C, D3, D4 
added (see text) 
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POWER CONSIDERATION IN 
SHIFT REGISTER MEMORIES 


Except for the 2401 and 2405, all of the shift registers 
listed in. Table 3.1 exhibit a power dissipation which is 
directly proportional to clock duty cycle. As a result, 
suitably designed shift register memories can retain data at 
very low power levels. These shift registers draw power 
supply current only when one of the clocks is active. As a 
result, lowest operating power is achieved at the lowest 
operating frequency when the clock is operated at its rated 


RAMs AS SERIAL MEMORY 


Random-access memories may be made to appear serial by 
providing a counter to generate addresses for the RAM. 
Using a read/write cycle allows the contents of the selected 
location to be read, after which new data may be entered. 
A divide by N counter makes a RAM of N or more words 
appear much like an N-bit recirculating shift register. 


Although a RAM may be used in this manner, certain 
features of true shift register will be absent. For example, 
because data does not actually move, editing by switching 
extra stages in or out is not available. Similarly, while a 
“tap” between sections of a shift register does not slow its 
operation, simulating the function of a tap when using the 
RAM simulation requires an additional access to memory at 


minimum width. For example, the 1402A/1403A/1404A 
family of shift registers can typically be operated at room 
temperature with dissipations of less than 0.2uW per bit. At 
such low power levels, the dissipation of clock drivers, 
address control and interface circuits becomes significant. 
As a result, special switching of power to clock drivers and 
interface pull down resistors may be necessary to achieve 
minimum power data retention. 


an address which must be computed by adding the tap 
displacement (from the end of the shift register) to the 
value in the address counter. Nevertheless, some CRT 
display terminals use RAMs for buffer storage, operating 
them in a mode similar to that of a shift register. The RAM 
used in this manner may be accessed in normal random- 
access mode at those times when it is not being “‘shifted”’ 
e.g., during CRT retrace. A static RAM used in this manner 
need not have a minimum shift rate. However, dynamic 
RAMs with refresh requirements (such as the 1103) may 
meet those requirements by using this pseudo-shift register 
organization with an appropriately chosen minimum shift 
rate, 
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APPLICATION OF SHIFT REGISTERS 


Although a shift register memory behaves much like an 
acoustic delay line or magnetic drum memory, unlike these 
memories, the shift register can be “stopped” for some 
maximum period and then instantaneously restarted.. As a 
-result, it is much more convenient to synchronize a shift 
register memory to other circuits than it is to synchronize a 
drum or delay line. In addition, shift registers waste less 
storage than the other techniques when used in certain ap- 
plications which do not have continuous data flow. 


One such application is the refresh memory for a cathode 
ray tube (CRT) computer output terminal. These terminals 
are used to display alphanumeric computer data and usually 
consist of an input keyboard, CRT with buffer memory, 
and a relatively low data rate interface to a (remote) 
computer. 


Figure 3.7 shows a block diagram of the display portion of 
such a terminal. The serial memory is usually realized with 
MOS dynamic shift registers and stores several hundred to a 
few thousand characters. To minimize the amount of 
memory, the characters are usually stored in a compact 
code of six to eight bits each. The display on the face of the 
CRT is usually a 5x7 or a 7x9 dot matrix, so that to 
display the character, the code must be converted from the 
compact code to the 35- or 63-bit display code. This 
conversion is performed by a read-only memory (ROM). 


(RASTER) 


CLOCK, DEFLECTION 
AND POSITION 
COUNTERS | 


CURSORS 


SERIAL 
DATA IN MEMORY 


Figure 3.7. CRT Display Terminal 
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Several types of CRT raster scans may be used with this 
type of display. If a standard video scan is used, the serial 
memory must present the same data for several lines in 
succession, i.e., for each line on which part of that row of 
characters appears. Figure 3.8 shows one way to organize 
the memory: so that no storage is wasted. One register is 
used as a recirculating register as well as a continuation of 
the main memory. In normal mode, control signal Y is true, 
X is false, and both clock sets are operated in synchronism. 
When data to be recirculated have been loaded into the 
recirculation register, Y is made false, X made true, and the 
clocks to main memory are then recirculated as many times 
as necessary by applying a sufficient number of clock pulses 
($1R and $9p). Note that the length of the recirculating 
register need not be exactly the same as the number of 
characters to be displayed — the shift register can be longer 
if there is sufficient time during retrace to shift the 
unwanted data past the output port. 


SHIFT REGISTER 
CHAIN (MAIN 
MEMORY) 


Figure 3.8. Serial Memory with Recirculating Loop 
(1-bit path shown) 


Editing of data in the shift register memory can be 
accomplished by providing an extra stage or two which can 
be switched in and out— effectively lengthening or 
shortening the shift register. Thus, when a bit must be 
inserted between two other bits, the data in the register are 
shifted until the two bits between which the new data must 
be inserted are adjacent to the extra stage. The data to be 
added are placed in the extra stage and the connections 
changed to include the extra stage in the shift register 


In 
chain. To maintain the length of the shift register at a 
constant value, some other character must be deleted. By 
clocking the data until the character to be deleted is in the 
extra stage, and then disconnecting the extra stage, the 
deletion can be effected. Fig. 3.9 shows one circuit which 
may be used to provide this feature. When A is false and B 
and X are true, the extra stage, realized by a D flip-flop, is 
included. When A is true and B is false, the stage is not 
included. Inputs Y and D are for loading the D flip-flop 
with data from an external source. The use of AND-OR- 
INVERT gates results in the data in the flip-flop being 
inverted, so the Q output is used. In Figure 3.9, the extra 
flip-flop stage is shown being clocked with clock phase $4. 
This signal (supplied at the proper TTL logic level) would 
be correct for the 1406 and 1407 shift register. However, 
when using the 1402A, 1403A, or 1404A shift registers, 
which are internally multiplexed, the signal to the clock 
input of the D flip-flop must be equivalent to ¢, and $9 
OR’ed together. In general, the D flip-flop must be clocked 
once for each data item transfer within the shift register. 


o, $92 


0 SHIFT 
REGISTER 


Figure 3.9. Circuit for Adding and Deleting Data in 
Shift Register Memory 


Another common organization used for CRT display 
terminals utilizes a “dither” scan or short range vertical 
scan super-imposed on a slower and coarser raster than that 
described above. The vertical dither scan is chosen to be 
one character in height, with the coarse raster having one 
horizontal trace for each line being displayed. Keeping the 
vertical scan rate the same as that used for conventional 
raster scans results in a much slower horizontal line rate. In 
addition, because each character is fully displayed after being 
fetched from the shift register memory, the number of 
accesses to the memory and hence the data rafe of the 
memory is reduced. As a result, the lower shift rate reduces 
the power dissipation of the shift register, and the single 
access per display eliminates the need for the one line 
recirculating buffer as shown in Figure 3.8. 


A. SHIFT REGISTERS FOR 
LARGER MEMORIES 


The MOS shift register offers a low-cost, high-performance 
memory that falls, both in access time and price, between 
the drum memory and the main frame ferrite-core or 
semiconductor memory. The shift register may be used to 
realize file memories with performance in this intermediate 
range. While most drums show access times in excess of 


Shift Registers | 


several milliseconds, and mainframe computer memory 


access time is usually under 1 ys, shift register memories are 
conveniently designed with access times in the order of 50 — 


_ to 500 pus. 


Shift register access time may be reduced below these 
figures in several ways not available with drum systems. For 
example, a typical shift register may be stopped for periods 
up to 1 ms without loss of data. If the requirement for a 
given access to the shift register memory can be predicted. 
in advance, it may be possible to bring the shift register to 
the proper location before the data are needed. To fully 
utilize this capability, it is necessary to predict the need for 
the access at least a worst case access time in advance of 
when the access is to be made. However, if the prediction | 
occurs too early, the shift register may have to be clocked 
past the desired location to retain the integrity of stored 
data. Of course, the data must then be fully circulated to 
again reach the desired location. However, if it is not 
possible to predict the next access within 1 ms of when 
needed (a typical maximum stopping time), the shift 
register might be stopped several addresses before the 
desired location. As an example, consider a 2 MHz shift 
register, 1024 bits long, with 1 ms hold time. Suppose that 
subsequent accesses to the shift register memory can be 
predicted as occurring some time between 0.5 and 10 ms in 
the future. Normal worst case access would be 0.5 ms 
without prediction. By stopping 10 addresses in advance of — 
the desired location, worst case access time is reduced to 
5 us, and yet the desired location can remain available 
within 5 us for at least 10 ms. 


When the capacity of one shift register is insufficient to 
store a given string of data bits, several shift registers may 
be placed in series as shown in Fig. 3.8. The series cascade, 
while the least expensive organization, increases the time to 
access data in the memory. : 


An alternate structure is shown in Fig. 3.10. This circuit . 
permits the outputs of several shift registers to be gated 
selectively onto the data bus. The select-and-read and 
select-and-write signals may be derived by decoding address 


SEL 2& W 


ss 


WR, DATA IN 


Figure 3.10. Combination Serial and Random Access 
Memory 
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bits in addition to those used for serial memory address. By 
using floating collector gates on the register outputs to 
drive the data bus, the wired OR connection may be used. 
Several of the shift registers listed in Table 3.1 include 
recirculation logic on the chip which permits interconnec- 
tions equivalent to Fig. 3.10 without additional logic. The 
1405A, 2401 and 2405 include this logic. 


When shift registers are used as file memories, it may be 
desirable to add some local processing. As was shown in 
Fig. 3.9, some editing operations may be implemented by 
switchable register stages. Additional logic may be added 
within the shift register “loop”’ to perform such operations 
as content searches, etc. 


B. ADDRESSING SHIFT REGISTERS 


The data in a shift register memory moves within the 


memory. As a result, some form of control circuit must be 


provided to establish which data (equivalent address) is 
available at the shift register terminal. 


One form of control utilizes a counter which cycles once 

for each circulation of the data in the shift register. That is, 

a divide by N counter would be used for an N-bit long shift 

register. (NOTE: N must correspond to the full length of 

the recirculation path. If a few additional stages of register 

are included in the recirculation logic, these stages must be 
included in the length count). 


Data in such a shift register memory can be “randomly” 
accessed by combining the control counter with an address 
comparator, as shown in Fig. 3.11. When an address is 
applied at “address in” in Fig. 3.11 and if the comparator 
does not indicate a match, the shift register and divide by N 
counter are advanced until a match is achieved. Worst case 
access time is equal to N times the minimum clock interval, 
while average access time is half this value. 


In some cases, a length other than that provided by the 
standard components of Table 3.1 may be desired. While a 
custom chip may be designed for the application, it may be 
possible to use a double clocking scheme to make a given 
length register appear to be a few stages shorter. To utilize 
this technique requires that the normal operating frequency 
of the shift register be less than half of the maximum rate 
specified for the device used. 


The double clocking scheme works as follows: Let N be the 
actual length of the shift register and M be the desired 


length, where ; <M<N. 


Then using a structure similar to that of Fig. 3.11, let the 
address counter divide by M rather than N. Provide a logic 
circuit which generates a function of the states of the 
address counter which is true for exactly N-M of the 
counter states. When this logic function is true, clock the 
shift register twice rather than once. As a result the 
corresponding input data item is entered into two con- 
secutive locations of the shift register. On output these two 
locations are shifted out in a single data interval, thus 
appearing as a single datum. 
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Figure 3.11. Random Access Control for Shift Register 


An alternate method for addressing data in a shift register 
memory utilizes a form of content addressing. Certain word 
states may be reserved as markers. For example, if a 4-bit 
wide register is to be used for binary-coded decimal (BCD) 
data, only the codes for ¢ through 9 normally appear. Some 
of the remaining codes, 10 through 15, may be used as 
markers to signify the starting point of each data string. To 
locate a given data item, the shift register is circulated until 
the proper marker is located. The desired data is usually 
placed in the locations immediately following the marker. 
In systems with very wide words, a number of separate 
marker tracks may be provided. 


For minimum power systems, for example, where batteries 
are used to maintain data in a memory in an otherwise 
non-operating environment, the power associated with 
address control may be significant. If the retained data is to 
be useful, when the system is restored to operation, the 
correlation of address counter to shift register data location 
must also be restored. One approach is to use a very low 
power technology, such as CMOS, for the address counter, 
and retain the correlation throughout the low power 
condition. An alternate technique, similar to the second 
addressing mode described above, may be used when 
certain memory word-states are not used as data items. One 
of these unused states may be used to mark a given address, 
perhaps address zero or address N-1. (Of course, this 
address is no longer available for data storage). After 
normal power is restored to the memory, but before 
operating it in the normal mode, the memory is circulated 
until the special marker code is found and the address 
counter is then reset to the corresponding value. In this 
way, the address count need not be retained during low 
power operation. However, the marker address and code 
must be reserved, and extra logic for writing the marker 
initially and finding the marker and resetting the address 
initially must be provided. 
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INTRODUCTION 


The first three parts of this booklet have discussed the most 
fundamental and common types of semiconductor 
memory. However, a number of other semiconductor 
memory organizations and structures are possible. Three 
memory or memory related structures are briefly discussed 
in this section. 


CONTENT 
ADDRESSABLE 
MEMORY (CAM) 


All of the random-access memories previously discussed in 
this booklet used a fixed address structure. At the time of 
storage, data is assigned an address in memory. To retrieve 
the data, that address must be supplied. Content address- 
able memories retrieve data based on content rather than 
address. When data are entered, they may be assigned to the 
first available locations. For such data to be retrievable, 
they must contain “keys” or identifiers to aid in their 
location. 


One realization of content addressable memory utilizes a 
memory cell which combines a basic random-access 
memory cell and a comparator. A Schottky technology 
bipolar realization of such a cell is shown in Fig. 4.1. 


CAMs, Buffers, Multiports 


Other Memory 
structures—CAMs, | 
Buffer Memories, and 
Multiport Memories 


In Fig. 4.1, transistors Q; and Qo form a memory flip-flop 
while transistors Q3 and Q, and the four Schottky diodes 
form a comparator. Data are entered into the cell in the 
conventional way using the row-select data and write enable 
lines. However, cell data may be non-destructively com- 
pared with data placed on the data lines. The match line 
will be drawn positive by any mismatch between a 
connected cell and data line data. If both DATA and DATA 
lines are held low, the contents of the cell will not influence 
the signal on the match line. 


CAM cells are arranged in a two dimensional array to form 
a memory. To expand such a memory, row select and 
match lines must be extended in the horizontal direction 
and data lines must be extended in the vertical direction. 
One of the problems associated with the expansion of a 


WRITE ENABLE 


ROW SELECT 


Figure 4.1. Content Addressable Memory Cell 
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CAM is associated with the connection of match lines. Each - The Intel 3104 is a 4x 4 bipolar CAM using the cell of 
match line must be individually connected to all cells in its Fig. 4.1. Figure 4.2 shows the package pin connections and 
row. If a row must extend over more than one memory a logic diagram of the 3104. As can be seen from Fig. 4.2b, 
chip, its match line must be made common to all chips each data input line has a corresponding enable line. 
contributing to the row. As a result, a much larger number Turning off the enable causes both internal data lines, 
of connections are required in a given size CAM than in the DATA and DATA in Fig. 4.1, to be held low so that the 
same size RAM. column of cells cannot contribute to match. 
a. Pin Configuration WRITE ENABLE WE(? 24 15 Vcc SUPPLY VOLTAGE 


“DATA INPUT 6,42 23 FE, BIT ENABLE INPUT 
DATAINPUT 0D, (1) 3 - 22 CE, BIT ENABLE INPUT 
DATAINPUT 0,1] 4 21 E, BIT ENABLE INPUT 
DATAINPUT 0, (795 20 E, BIT ENABLE INPUT 

MATCH OUTPUT M,C] 6 1917) A, ADDRESS INPUT 
MATCH OUTPUT M, (J 7 18 [7] A, ADDRESS INPUT 
MATCH OUTPUT M,(}8 17 FA, ADDRESS INPUT 
MATCH OUTPUT ™, (9 16[F)A, ADDRESS INPUT 
MATCH OUTPUT = M, [-} 10 15[7]6, DATA OUTPUT 


“DATA OUTPUT 6,(711 14170, DATA OUTPUT 
GROUND GRD [J 12 13{0)6, DATA OUTPUT 


*DATA IN and DATA OUT are of the same logic levels. For a chip that is not selected, the data output is 
at a high level. 


b. Logic Diagram 
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Figure 4.2. Intel 3104 CAM 
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Figure 4.3, CAM Organization 


To use a CAM, each word entered into the memory 
BINARY 


contains a “key (search information)” as well as a data ADDRESS SEARCH 
portion. In some systems, what is considered a data ne Oe, sea gle 
portion at one time may at a later time be used for | | 


searching. Figure 4.3 shows how an array of 3104s 
may be organized in an 8-word memory with a 4-bit key 


and a 4-bit data portion to each word. DECODER 


PRIORITY 
ENCODER 


(74148) 


Some organizations may permit multiple matches when 
searching a CAM. For example, the search may be 
equivalent to the question, ““How many entries do I have 
which used X as the key?” The organization of Fig. 4.3 
would result in simultaneous selection of several data words 
if several such entries were found. To resolve multiple finds, 
a priority encoder may be used. The encoder selects only Figure 4.4. CAM/RAM Combination 
one of the several addresses. The addresses selected by the for Multiple Entry CAM 

search may be polled in sequence to determine the various 
data items. Some IC priority encoders give binary addresses 
as output. To use these with the 3104 would require a 
decoder to generate the linear select addresses for the 3104. 


DATA OUT 


Content addressable memory function can also be realized, 


; ie : with much longer effective access times, by actually 
If the data portion does not participate in searches, a searching the memory. For example, a shift register 


3101/3101A RAM might be used as in Figure 4.4. memory may be designed to include the comparison 


Small fast CAMs find application in buffer memory control function as part of the recirculation logic. The data are 
(see below) and certain types of parallel processors. Possible completely searched in one circulation of the memory. 
applications are found in pipelined processors and the 
implementation of some functions, such as sorts, searches, 
etc. 
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BUFFER MEMORIES 


A small, fast random-access memory (buffer) may be used 
to store images of a number of blocks of data selected from 
a much larger, but slower, memory. During execution of a 
program, a processor using a buffer needs to access the 
main memory only when data cannot be found in the 
buffer. When a buffer control strategy can be found which 
reduces the number of main memory accesses to a small 
fraction of the total number of accesses, the system may 
operate as if the large memory had an access time 
approaching that of the buffer. (In general, faster memories 
cost more per bit than slow ones — the buffer effectively 
reduces the cost of a large, fast memory). A block diagram 
of a buffered memory system is shown in Fig. 4.5. 


IMAGE OF DATA BLOCK DATA BLOCK IN 
IN BUFFER MAIN STORAGE 


PROCESSOR W, 


BUFFER 


MAIN MEMORY 


BUFFER 
CONTROL 


Figure 4,5. Buffered Memory System 


Considerable literature has been developed concerning 
_ design of buffer memories, control algorithms, and their 
performance vs. size. Since it is not possible within the 
scope of this booklet to include details of buffer design, the 
interested reader is referred to the literature.* 


A number of the fast, bipolar memories mentioned in Part I 
may be used for the buffer memory. The 3101, 31014, 
3106, and 3107 may be used for the fast, random-access 
memory. Depending on the type of buffer control strategy 
used, the 3104 CAM may be useful for storing buffer 
control tables. These tables are used to identify which data 
blocks from the main memory reside in the buffer, and 
identify the location in the buffer of those that are present. 
The CAM, or an associated RAM, may also be used to store 
information about the degree of utilization of each block in 
the buffer and to indicate whether the contents of a block 
have been altered by program execution. When the buffer 
is full, a new block from main memory will have to replace 
a block in the buffer. The block to be replaced is often 
chosen based on the time of last access or frequency of 
access. If a replaced block has been altered, the data must 
be rewritten to main memory. Some designs write back to 
~ main memory whenever alteration of a location takes place, 
while others write only when the block is replaced. 


When semiconductor memory is used for the main store, 
the system may be designed to take advantage of the 
non-destructive read and simplified output buffering. For 
example, the main memory may be organized in small, 
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interleaved modules with the data being transferred to the 
buffer by a polling technique. 


*R.L. Mattson, et. al. ‘‘Storage Hierarchy Designs”, IEEE 
COMPCON 1972 Digest, pg. 147. 


MULTIPORT MEMORIES 


Large computer systems may include several processors, e.g., 
one or more central processors, several data channels, etc., 
all communicating with a very large memory. If the large 
memory is divided into a number of modules, each having 
several independent access paths, or “‘ports”, one processor 
may access one memory module at the same time that 
another processor is accessing another module. In this way, 
the effective bandwidth of the memory is increased. The 
resultant structure, as diagrammed in Fig. 4.6, somewhat 
resembles a cross point switch array. 


Each memory module in Fig. 4.6 watches the various 
processor busses. When requests for data within the module 
are received, the memory switches itself onto the highest 
priority requesting bus. As a result, processor No. 1 may 
access memory No.2 even while processor No.3 is 
accessing memory No. 1. Although interference is possible, 
and some delay is introduced by the extra logic, overall 
bandwidth is increased. 


Another multiport structure is used in fast, scratch pad 
memories where it may be desirable to fetch two operands 
at once. Although true that two or more port memories 
may be designed with semiconductor technology, they are 
often significantly more expensive to produce than a single 


PROCESSOR 
NO. 1 


PROCESSOR 
NO. 2 


PROCESSOR 


NO. 3 


MEMORY MEMORY MEMORY 
MODULE MODULE MODULE 
NO. 1 NO. 2 NO. 3 


Figure 4.6. Multiport/Multimodule Memory System 


port memory. In some cases where two port access is 
needed, it may be desirable to duplicate the scratch pad 
memory. When reading from the memory, one datum is 
taken from each scratch pad. However, when writing to the 
scratch pad, identical data is written into both scratch pads. 
Thus, the two scratch pad machine appears as a two port 
memory for reading, but as a single port memory when 
writing. 


intel’ 


Part 5 


All of the memory products described use binary address- 
ing. For example, the 1103 provides 1024 address locations 
selected by 10 binary address lines. However, some 
machines are more conveniently organized to use binary 
coded decimal (BCD) addressing. For example, 1000 
address locations may be desired, selectable by 12 BCD 
leads (Lead labelling might be U1, U2, U4, U8, T1, T2, T4, 
T8, H1, H2, H4, H8— for units digit bit 1, units digit bit 
2,...., tens digit ...., hundreds digit bit 8). A BCD to 
binary encoder could be used to perform a translation from 
the 12 line BCD to a 10 line binary code so that a standard 


1103 could be used for the memory. However, full 


translation is cumbersome. It should be noted that any 
one-to-one mapping of BCD addresses to binary addresses 
will serve this purpose. Thus, a true BCD to binary 
conversion need not be performed. 


To generate a mapping, the designer should note that 
certain BCD codes cannot appear. For example, line U8 
cannot be true when U4 or U2 is true. However, given any 
state for the 9 variables U8, U4, U2, T8, T4, T2, H8, H4, 
H2, all possible combinations of U1, T1, and H1 may 
appear. Thus, U1, T1, and H1 are all effectively binary 


0 0 0 U2 U4 T2 T4 H2 H4 0O 
0 0 1 U2 U4 T2 T4 O 0 I 
0 1 0 U2 U4 H2 H4 0 1 1 
0 1 1 U2 U4 0 0 1 1 1 
1 0 0 H2 H4 T2 T4 1 0 1 
1 0 1 T2 T4 0O 1 1 1 1 
1 1 0 H2 H4 1 0 1 1 1 
1 1 | F222 TAF <1 1 1 1 1 


BCD Addressing 


BCD Addressing 


variables. If the variables U8, T8, and H8 are all zero, all the 
remaining variables are effectively binary. Thus, if the 
binary address leads are designated Bo, By—Bg, a suitable 
mapping can be derived as shown in Table 5.1. 


Table 5.1. . 
Generation of 10-Line Address from 12-Line BCD 


BO = Ul 
Bl = T1 
B2 = Hil 


*These locations are ‘“‘don’t cares’? — T2 and T4 have been 
chosen for easier implementation. 
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' Table 5.1 is read as follows: Consider the second line of the 
tabular part where the code 001 appears on the left and the 
series of variables U2, U4—0,0,1 appear in the columns for 
Bg, By,—Bg. This line implies that if U8 = 0, T8 = 0, and 
H8 = 1, then the output variable Bg should be equivalent to 
the input U2, the output B, should be equivalent to the 
input U4, etc., with output Bo being held at 1. Table 5.1 
may be filled out in a number of different ways. The 
following rules must apply: 


1. For each code on the left, all variables which are 
effectively binary must appear on the right. Thus, 
except where the 8-bit of the digit is one, the 
corresponding 4- and 2-bit variables must appear on the 


DUAL 

4 INPUT 
1 OUTPUT 

MULTI- 

PLEXER 


8 INPUT 
1 OUTPUT 
MULTI- 
PLEXER 


8 INPUT 
1 OUTPUT 
MULTI- 
PLEXER 


BCD Addressing 


right. This organization insures that all possible input 
states on the left produce unique output states on 
the right. 


2. Each row on the right must not overlap any other row. 
In other words, when any two rows on the right are 
exclusive NORed together, at least one column must be 
zero for all possible variable values. 


The logic to implement the function of Table 5.1 is shown 
in Fig. 5.1. A fast ROM with at least 512 words of at least 
7-bits each could also have been used to realize these 
functions. As shown about 6 packages of TTL, SSI and MSI 
are required. 


B3 = U2- U8 +H2- U8- H8+7T2- U8- HB 


B4 = U4- U8 + H4- U8 - H8+ T4- U8S- HB 


B5=T2- U8- T8+7T2-7T8-HB+H2- T8- H8+ UB: TB 


86=14-U8-T8+7T4-T8-H8+H4- U8- T8- HB+ U8- TB 


B7 = H2- T8- H8+ U8+T8- 


B9 = U8 + T8 + H8 


B8 = H4- U8- H8+T8+U8- 


Figure 5.1. Realization of BCD Modified Binary Address 
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1. Grow oxide 
Cut diffusion hole 
Diffuse p dopant 
Strip off oxide 
Regrow oxide 
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2. Etch gate area 
Grow gate oxide 


3. Etch contact holes 
Deposit aluminum 


4. Etch aluminum 


a, p-channel Aluminum Gate 


All Intel p-channel and n-channel components are realized 
with silicon gate MOS technology. This semiconductor 
technology, first offered commercially by Intel in 1969, 
offers many advantages over aluminum gate technology. In 
Fig. A-1, the steps in the manufacture of conventional MOS 
(aluminum gate) technology are contrasted with those for 
p-channel silicon gate technology. 


Each step in Fig. A-1 represents one photolithographic 
operation (one Masking step). Listed below each drawing 
are the steps to bring the circuit to that phase of its 
manufacture. 


n 


1. Grow oxide 
Cut transistor and diffusion windows 
Grow gate oxide 
Deposit silicon 


2. Etch silicon and gate oxide 
Diffuse p dopant 
Deposit glass overall 


3. Etch contact holes 
Deposit aluminum 


YM 


4. Etch aluminum 


b. p-channel Silicon Gate _ 


Figure A-1. MOS Processes 
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Some of the advantages of using silicon gate processing are 
made obvious by Fig. A-1: 


1. Gate oxide is protected immediately in the silicon gate 
process, while in conventional processing the gate oxide 
remains exposed throughout one entire masking step. 

2. The self aligned gate permits a smaller device with less 
gate to drain capacitance than is possible in conven- 
tional technology. Faster, more compact circuits are 
made possible. 

3. The silicon layer can be used for interconnections, 
permitting reduced chip area per function. In many LSI 
circuits, interconnection area affects cost even more 
than active component area. 


Some other, less obvious, advantages include improved 
reliability due to the number of protecting layers above the 
gate oxide, and lowered threshold voltage due to the use of 
silicon rather than aluminum as the gate material. This 
threshold reduction is achieved with very little change in 
the intermediate field threshold—i.e., the voltages at which 
parasitic MOS devices are created under silicon lines over 
thick oxide. 


A. STATIC AND DYNAMIC MOS CIRCUITS 


The unique characteristics of the MOS FET permit the 
construction of a wide variety of logic circuits with the 
devices. The MOS device may be used as an active 
amplifying device or as a load resistor. In Fig. A-2, the 
characteristics of a device are shown, together with a curve 
corresponding to the device used as a load resistor with a 
15 V supply. These curves show drain current (Ip) vs 
drain-to-source voltage (Vpg) with gate-to-source voltage 
(Vag) as a parameter. The substrate is assumed at source 
potential. The load resistor curve is only approximate, as 
substrate bias effects have been neglected. In the discus- 
sions which follow, high and low refer to the relative 
magnitude of the voltage with respect to the substrate level. 
Polarities are to be assumed correct for p-channel devices, 
i.e., all voltages negative with respect to the substrate. 


Figure A-2. Typical MOS FET Characteristics (Dotted line 
shows load line for device used as pull up resistor) 


In Fig. A-3, four types of MOS logic inverter stages are 
shown. In Fig. A-3a, two MOS devices, Q; and Qo are 
wired as a static inverter. When input 1 is sufficiently high, 
Qo turns on and output 0 is low. If 1 is low, Qo is off and 
Q; pulls the output high. This circuit requires that Q, have © 
radically different geometries than Qo. For equivalent bias, 
Q»5 must have much higher conductance than Q; to get 
reasonable noise margins. This fact is symbolized in 
Fig. A-3a (and in A-3b) by showing Q, symbolized as a 
MOS resistor rather than as a device. As a result of the low 
conductance of Q,, current available from Q, for charging 
load capacitance is quite limited. As a consequence, low to 
high transistions are rather slow. 


Fig. A-3b is a variation on Fig. A-3a. When the clock ¢ is. 
active, the circuit is much like that of Fig. A-3a. By making 
the clock voltage ¢ higher than V, a more consistent high 
output level may be established. 


Once the output level is established, the clock @ may be 
switched off to save power. This circuit technique may also 
be used to give improved noise margins. 
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d. Dynamic Non-Dissipating 


Figure A-3. MOS Inverter Stages 
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In Fig. A-3b, the output behaves as an inverter when the 
clock @ is active. In Fig. A-3c, Q, has such a high 
inductance that when the clock is active (high), low outputs 
may have noise margins so poor that they are unusable. 
However, after the clock is removed, low outputs quickly 
approach usuable values if the input “1” is maintained long 
enough. During the time that ¢ is active, this circuit may 
consume large amounts of power (if input 1 is high). This 
circuit may be used to drive relatively large capacitive loads 
because Q, may have relatively high conductance and 1 
may have higher amplitude than V. 


In the circuit of Fig. A-3d, the capacitive load is charged 
when @ is high, and then is discharged when ¢ goes low, if 
and only if 1 is high. This circuit draws current only to 
charge and discharge the load capacitance; there is no DC 
power drain. However, the load capacitance is reflected 
back to the clock driver. 


The circuits of Fig. A-3b, A-3c and A-3d make use of 
temporary retention of data on the load capacitance, and 
are said to be dynamic, while the circuit of Fig. A-3a is DC 
stable and therefore is said to be static. 


Larger capacitive loads may be driven by static inverters if a 
booster stage is used. However, a loss of “high” level is 
introduced unless bootstrap techniques are utilized. A 
booster stage is shown in Fig. A-4. 


Figure A-4, MOS Inverter with Output Booster 


In Fig. A-5, a transmission gate is shown. The load 
capacitance Cy, may be charged to a voltage equal to that 
on Cg when the clock ¢ is high. When @ is returned low, Cy, 
retains the value to which it was charged. 


The circuit of Fig. A-5 may be driven from a number of 
different source circuits. However, when the source circuit 
is dynamic, the transmission gate clock, ¢ should be kept 
high during the entire period when Cg and Cy, are being 
charged and discharged or improper voltage levels may 
result. For example, if Cg is initially charged and Cy, 
initially discharged, when the transmission gate clock is 
made active, Cg and Cy, share their charge, resulting in a 
voltage level which is a function of the ratio of Cy, and Cg. 
Unless the circuit is designed to operate with such 
intermediate voltage levels, incorrect operation will result. 
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Figure A-5. Transmission Gate 


These circuits give some idea of the flexibility of design of 
logic with MOS FET’s. In addition to these techniques MOS 
devices can easily realize compound gate structures. An 
example is shown in Fig. A-6. On Fig. A-6, only eight 
devices are used to realize the function 


f = (ab + cd + axd + cbx + yz) 


This type of compound gate structure can be used with any 
of the forms of Fig. A-3. 


Figure A-6. Compound Gate. Example shown realizes 
f = (ab + cd + axd + cxb + yz) 


B. STATIC AND DYNAMIC MOS MEMORY 
CELLS 


Fig. A-7 illustrates two basic types of MOS memory cells. 
Fig. A-7a, shows a static memory cell consisting of two 
static inverters wired as a flip-flop. A pair of transmission 
gates selectively connects the cell to a pair of vertical data 
wires. Such cells may be arranged in a two dimensional 
array. When one select wire is made high, the corresponding 
row of cells is effectively connected to the data wires. 
Writing is accomplished by forcing signals on the data wires 
which are sufficient to “flip” the flip-flops. To prevent 
flipping the cell when reading, both data wires are initially 
charged high prior to activating the select line. 


a. STATIC 


b. DYNAMIC 


Figure A-7. Memory Cells 


Operation of the dynamic cell of Fig. A-7b is quite 
different from that of the static memory cell. Data is stored 
as charge on the parasitic capacitance Cg associated with 
the gate of Qo and the connected junction of Q,. Data may 
be written into this capacitance via the transmission gate 
formed by transistor Q,. The data to be written is placed 
on the WDATA line and WSEL is activated (made high). To 
read from the cell, the RDATA line (with its associated 
capacitance Cp) is initially charged high by activating the 
signal ¢. When the line RSEL is activated, the RDATA line 
will be discharged if and only if the capacitor Cg contains a 
high, and will remain high if and only if Cg contains a low. 
We may, therefore, consider that the RDATA line then 
contains the logical complement of the cell data. 


Although the read out operation from the cell is non- 


destructive, the leakage associated with the junction of Q; 


eventually may result in the loss of the charge stored in Cg. ab oa 
To maintain the data stored in the cell, the data must be EN 
periodically regenerated. This regeneration is accomplished - 


by reading the contents of the cell, out onto the read data 
line, inverting and amplifying the signal and applying it to 


the WDATA line, and rewriting back into the cell DY So 


activating the WSEL line. A circuit which performs the 


inversion and amplification function is called a refresh _. 


amplifier. 


In use, the dynamic cells are laid out in a two dimensional 


array. One entire row of cells is refreshed (or accessed) at — 


one time, one refresh amplifier being provided for each 
column of cells in the array. To refresh the entire memory, 
each row of cells must be individually refreshed. 
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APPENDIX B-SCHOTTKY BIPOLAR TECHNOLOGY 


Bipolar technologies are familiar to the user of logic in the 
form of RTL, TTL and ECL families. Most early bipolar 
technologies utilized doping with gold to reduce minority 
carrier lifetime. Without gold doping, the storage time of 
transistors in saturation would slow logic circuit operation. 


While gold-doping made fast circuits, the side effects 
limited circuit capabilities. Because of the reduced minority 
carrier lifetime, such wide base devices as lateral and 
substrate pnp transistors were not usable in gold doped 
technologies. The effects of the doping were diminshed at 
high temperatures, so that speed of performance would 
have to be derated for high temperature operation. 


However, in the late 1960’s, it was discovered that an 
aluminum to n-silicon contact formed a Schottky barrier 
diode. While early diodes were somewhat erratic in 
performance, these diodes exhibited a lower forward 
voltage drop than conventional p-n junction diodes. It was 
recognized that such a diode could be used in the circuit 
known as the Baker clamp to prevent transistor saturation. 
Using this circuit would then eliminate the need for gold 
doping. Figure B-1 shows the Baker clamp circuit. 


In Figure B-1, when i, is sufficient to drive transistor Q 
into saturation, the collector voltage falls to a level which 
forward biases the Schottky diode D. Further increases in 


SCHOTTKY 
DIODE 


Figure B-1. Schottky Clamped Transistor 


ip flow through diode D rather than into the base of 
transistor Q. Because of the diode, transistor Q is not driven 
hard into saturation. Thus, upon turn off of the current ip, 
transistor Q exhibits negligible storage time. 


The first to develop a Schottky process using this diode was 
Intel Corporation, with the first such product being the 
3101 64-bit memory. The Schottky diodes are formed by 
contact of the aluminum metallization to the collector 
region. To make ohmic contact to the collector, the 
conventional technique of first diffusing n+ region (emitter 
diffusion) into the n— collector region is used. An 
aluminum contact to the n+ region is ohmic while contacts 
to the n— region become Schottky diodes. This process 
requires no additonal masking steps to produce the 
Schottky diodes. All of Intel’s bipolar devices utilize the 
Schottky technology. 


Because this technology does not destroy the gain of 
substrate and lateral pnp transistors, these components 
become available to the circuit designer. Most of Intel’s 
bipolar products use substrate pnp transistors on all inputs, 
as shown in Fig. B-2. As a result, the inputs offer far less 
DC loading than standard TTL logic. This feature is most 
helpful when interfacing to MOS logic. - 


EMITTER (Equivalent to npn base) 
BASE (Equivalent to npn collector} 
COLLECTOR (Substrate) 


Figure B-2. Intel Substrate - pnp Bipolar Input Circuit 
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Solid state 


Schottky diodes make IC scene 


With reproducibility problems licked, these devices make attractive elements; 


they permit unusual component combinations, save chip real estate, 
reduce power dissipation, and enhance production yields 


By.R.N. Noyce, et. al. 
Intel Corp., Santa Clara, California 


Schottky diodes, because they don’t store charge, 
open new worlds to digital integrated circuits. Di- 
ode-transistor logic with Schottky diodes can be 
_ made as fast as transistor-transistor logic and emit- 
ter-coupled logic. What’s more these devices reduce 
an IC’s power dissipation and permit unusual com- 
binations of components on a chip. They also make 
large-scale integration easier to achieve because 
more functions can be performed in less area. Not 


_ the least of their virtues is the fact that they can 


increase production yields. 

For all these advantages, Schottky-diode IC's 
have not been commercially available. Intel Corp., 
however, has concentrated on their development 
and is now making Schottky diodes that are stable 
and reproducible. The company has committed 
Schottky IC’s to commercial production, and has 
found that they live up to expectations. Their 
first Schottky IC will be introduced in the next 
few weeks. 

Structurally, the Schottky diode is little more than 
a metal in contact with a semiconductor. As long as 
aluminum is used for the metal, formation of 
Schottky diodes in a monolithic IC is compatible 
with standard processing. Some provision is needed 
to prevent high-field effects at the edge of the metal 
contact, but this too is compatible with standard 
processing. In fact, once reproducibility problems 
have been solved, Schottky diodes actually simplify 
processing of high-speed digital IC’s because gold 
doping is eliminated. 

The Schottky diode has been around in one form 
or another for a long time—the crystal detector in 
early radios, for instance. The modern device, 
which gets its name from the German scientist who 
developed the first valid theory of metal-semicon- 
ductor rectification, can be used to enhance the 
performance of existing IC designs as well as to 
develop entirely new configurations with excep- 
tional features. 
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In the forward-biased p-n junction, current is 
carried by holes flowing from the p-type material 
into the n-type material, and electrons flowing 
from the n to p material. Either case results in 
an excess of minority carriers near the junction. 
If the voltage is reversed these carriers will flow 
back across the junction, creating a high current 
until the supply is exhausted. In other words, the 
p-n junction can’t be turned off immediately. 

In a Schottky diode made of aluminum on n-type 
silicon, essentially all of the forward current is 
carried by electrons flowing from the semicon- 
ductor into the metal. They quickly come into 
equilibrium with the other electrons in the metal, 
so there is effectively no stored charge to prevent 
rapid switching. Another major point of difference 
between the Schottky and the p-n junction diode 
is that the former has lower forward voltage for 
a given current. 

In a practical circuit the Schottky diode is placed 
in parallel with the base-collector junction of an 
npn transistor; the metal electrode is connected to 
the base and to the n region of the collector, where 
it forms a rectifying contact. 

Since the Schottky diode has a lower forward 
voltage compared with the collector-base junction, 
the diode clamps the transistor and diverts most 
of the excess base current through the Schottky 
diode, preventing the transistor from saturating. 
There’s no charge storage, either in the transistor 
or diode. 

Clamping techniques have been used in the past 
to prevent charge storage. Most are variations of 
the Baker clamp proposed in 1956.1 In this scheme, 
a germanium diode shunts the collector-base junc- 
tion of a silicon transistor to prevent it from being 
forward-biased; some charge is still stored in the 
germanium diode. The Schottky IC employs clamp- 
ing but charge storage is eliminated. 

In the past, other efforts to improve switching 


speed by reducing or avoiding stored charge have 
taken two approaches: process innovation and novel 
circuit design.” 3 4 

Attempts to minimize storage time through novel 
circuit designs have had varying success. The most 
familiar is current-mode switching, which avoids 
saturation (and hence charge storage) by control- 
ling the collector current so that the collector-base 
junction never goes into forward conduction. Cur- 
rent-mode logic is fast—propagation time through 
a gate can be as low as 1 nanosecond—but when 
such IC’s are assembled in a system, they will 
oscillate unless precautions are taken to control 
their input impedance characteristic. 


Off the gold standard 


The standard processing approach to charge- 
storage reduction in conventional IC’s is gold dop- 
ing. Since gold acts as a recombination center, it 
reduces the lifetime of minority carriers; that is, 
it shortens the time for recombination of holes and 


metal is also the anode of the diode; the collector 
avoided with the extended-metal or guard-ring struc- 


and Schottky transistors. The former uses up 7 nsec 
of recombination time before it can change state; 


electrons. Intentional reduction of minority car- 
rier lifetime is somewhat ironic. In the early days 
of transistors, much effort was expended in increas- 
ing the minority carrier lifetime to increase current 
gain and decrease junction leakage. 
Unfortunately, the gold diffusion has some highly 


undesirable side effects. It tends to reduce gain 


(hz). If this value gets too low, the IC’s won't 
function because the transistors either don’t satu- 
rate or have excessive turn-on delay. The manufac- 
turer can use a narrower base to get higher hgr, 
but this can lead to excessive junction leakage. 
The Schottky process sidesteps this problem. 
Without the need for gold doping, low storage 
time and high hrg can be achieved simultaneously. 
Another difficulty with gold doping is the lack of 
selectivity—if one transistor must be high speed, 
all the others on the chip have to be high speed too. 
But with the Schottky process, fast and slow de- 


vices can be mixed on the same chip. This freedom 


of choice in component characteristics is exploited 


Double duty. In a Schottky transistor a Schottky diode 
is in parallel with the collector-base junction. The base 


diffusion is also the diode cathode. Edge effects can be - 


ture. The curves compare storage time of the gold-doped 


the Schottky transistor responds almost instantaneously. 


‘fe 


DEPLETION 
REGION 


HIGH ELECTRIC 
FIELD METAL 


CAMS: 


REGULAR DIODE 


DEPLETION 


METAL EXT OVER 
ETAL EXTENDS REGION 


OXIDE LAYER 


Si0e 


EXTEN DED-METAL 


DEPLETION 
REGION 


DIFFUSED 


GUARD-RING 
Edge effect. The high field concentration at the edge of 
the regular Schottky structure can cause spurious 
currents. These can be avoided by means of the 
- extended-metal structure or the guard ring. 


in some of Intel’s new products. 

But before Schottky diodes could be successfully 
integrated manufacturers had to resolve some 
dilemmas; diode characteristics were often grossly 
different from theoretical predictions and devices 
were not reproducible. The cause, it has since been 

found, is the high electric-field concentration at 
the periphery of the metal electrode, as shown 
above. This field, through the mechanism of tun- 
neling, can lead to spurious currents that dominate 
the ideal diode characteristic. Because the shape 
of the edge varies from one device to another, the 
magnitude of these excess currents can vary greatly. 

Two solutions to this problem were recently pro- 
posed.®: ® 7 In one, the metal electrode extends over 

the silicon-dioxide passivating layer. With this 

dielectric layer between the metal and the semicon- 
ductor, the electric field at the periphery of the 
electrode is greatly reduced, and the diode has 
ideal characteristics. 

In the other solution, the edge of the metal ex- 
tends over a diffused p+ guard ring. This also 
reduces the field, and moves it away from the 
edge of the electrode so that it can’t affect the 
‘diode. Again, the characteristics are ideal. 

Electrically, the aluminum-silicon Schottky diode 
differs somewhat from the diffused p-n junction 
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diode in its d-c forward current-voltage character- 
istic, shown at right. The forward voltage of 
the Schottky diode at any given current is 200 to 
300 millivolts less than that of the junction diode, 
making it an ideal low-voltage clamp. 

Another major difference is the recovery charac- 
teristics, opposite page at bottom. The Schottky di- 
ode storage time is effectively zero, in contrast to 
typical values of 6 nsec for the gold-doped junction 
diode and 30 nsec for the junction diode without 
gold doping. In all three cases, the voltage across 
the diode decays with the same R-C time constant 
once the stored charge recombines. 

The Schottky transistor is simply an npn IC tran- 
sistor, produced by conventional photolithography, 
diffusion, and epitaxial growth techniques with an 
Al-Si Schottky diode in parallel with the collector- 
base junction. The base contact of the transistor 
serves as the metal contact of the diode and the 
collector region of the transistor serves as the n 
region of the diode. It’s convenient to regard the 
npn-Schottky diode combination as a single device 
and to represent it by a single symbol: 


To form the Schottky transistor, the base-contact 
opening is extended beyond the base diffusion and 
over the collector region. When the aluminum 
metalization is deposited, it simultaneously func- 
tions as the contact to the base region and the 
anode of the Schottky diode. To prevent high field 
concentration, the metal can either be extended 
over the passivating oxide or terminated over a p+ 
guard ring, which can be diffused into the chip at 
the same time as the base, as shown on page 75. 

One big difference between the Schottky tran- 
sistor and the conventional gold-doped npn device 
is in offset voltage, which is lower for the latter— 
typically 120 millivolts vs. 250 mv. However, the 
saturation voltage need not be excessive because 
of this large offset; by controlling both device 
geometry and processing, Vox(sat) can be kept to 
0.5 volt at 20 ma collector current and 1 ma base 
current. And because the Schottky diode has a 
lower temperature coefficient than the emitter- 
base diode, Vor(sat) decreases with increasing tem- 
perature. 

This behavior of the Schottky diode amounts to 
partial temperature compensation of the emitter- 
base diode and hence of Vorjsat). It helps to 
maintain a high worst-case logic 0 noise margin 
at high temperature when the Schottky transistor 
drives fan-out circuits that operate at a 2Vpz logic 
threshold as standard DTL and TTL IC’s do. 

The storage time of a Schottky transistor is 
effectively zero, as shown on page 78, in contrast 
to typically 7 nsec for a gold-doped device and 
34 nsec for a non-gold-doped unit. Moreover, stor- 
age time hardly varies with temperature. At 125°C, 
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Forward characteristic. At 1 ma forward current, voltage 
drop is 0.45 volts for the Schottky diode, 0.74 volts for 
the p-n junction diode. 
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No storage. Transient characteristic shows that the 
Schottky diode, unlike p-n junction diodes, has essentially 
zero storage time. 


it’s still less than a nanosecond, whereas the gold- 
doped storage time doubles to 15 nsec at that 
temperature. 

Aside from much faster switching speed, Schottky 
diodes offer the IC designer a greater range of de- 
vices to choose from. Because gold doping can’t 
be done selectively, the designer has at his dis- 
posal only one active component: a fast switching 
npn transistor whose emitter-base and collector- 
base junctions can be used as diodes. But since the 
Schottky process doesn’t require gold doping, the 


designer has seven active components to choose 
from. And he can mix them at will—fast- and slow- 
switching devices can be placed on the same chip. 
In both the Schottky and the gold-doped transistor, 


- various levels of resistivity are available in the bulk 


material, as well as in the isolation, emitter and 
base diffusions for use as resistors. 


Advantageous options 


The Schottky diode across the base-collector - 
junction of the npn transistor is an almost ideal 
active switch with less than 1 nsec storage time and 
high amplification. Without the Schottky diode, 
the device becomes a charge-storage npn transistor 
characterized by low saturation voltage, high in- 
verse gain (hrg), and lengthy recovery time. — 

Substrate pnp transistors can be diffused into 
the chip too; these can have hry of 10 or more and 
are ideal for input buffering. Lateral pnp transistors 
can be formed as well—with and without Schottky- 
clamped collectors to provide fast or slow recovery | 
time. These devices are useful as curent sources 
and for voltage translation. 

Then there are SCR’s. Like transistors, hese can 
be diffused into the chip in both charge- -storage 
and Schottky-clamped versions, each of which can 
be optimized for different purposes. They make 
efficient bistable elements and give a high func- 
tional density to integrated shift registers and 
counters. 

And, of course, there are diodes—emitter-base, 
charge- -storage, and Schottky—each with different 
characteristics. 

With just the Schottky diode and transistor, it’s _ 
easy to upgrade old IC designs. Take DTL, for 
instance. If, without changing any of the resistor 
values, the basic DTL gate is modified with Schottky. 
components as shown at top of page 80, the per- 
formance of the gate is improved as follows: 

= Less sensitivity to power supply variations, 
since the circuit will continue to function at a lower 
minimum supply voltage. This is because; when 
all inputs are in the high state, the voltage at the 
base of Q, is typically 0.3 volts lower that it would 
be in the gold-doped counterpart. 

" Less sensitivity to temperature variations be- 
cause of the compensating effect of the Schottky 
diode temperature behavior. 

= Faster switching; the gate turns on faster, and 
storage time is less than 1 nsec, remaining so. 
throughout the operating temperature range. 

In addition, modified DTL circuits can be fab- 
ricated at better yield (and lower cost to the user) 
because hyg is higher and leakage is lower than in 
the gold-doped version. 

The Schotiky version of the TTL gate also has 
some important advantages over its gold-doped 
counterpart: 

* Lower input inverse leakage current, typically 
less than 0.1 microampere. This is made up of the 
reverse-bias leakage of the emitter-base junction 
and the input clamp diode; there is no contribution 
from inverse hyg since the collector base junction 
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of the input transistor never becomes forward 
biased. 

" Less line reflection, since the input clamp di- 
odes—now Schottky diodes—have a lower forward- 
bias voltage. 

= Lower power dissipation because no charge is 
stored (and wasted) in the transistors. 

» Higher output high level, by approximately 
0.3 volt. 

Although the Schottky process significantly im- 
proves many of the existing circuits, its real poten- 
tial lies in entirely new designs. For example, by 
designing from scratch with the Schottky process 
in mind, a DTL gate can be made just as fast as 
TTL. An example of such a circuit is shown 
on page 80, bottom left. 

_ When the circuit is loaded with two-200-ohm 
resistors, as shown, the switching speed is about 
5 nsec, comparable to that of the fastest TTL cir- 
cuits commercially available. The advantage of DTL 
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Storage time. The Schottky transistor, because it doesn’t 
store charge, responds much more rapidly to an input 
signal (top traces, ambient temperature 25°C). This 
difference becomes even more pronounced at high 
temperatures (bottom curves). 


over TTL, given equal speeds, is that the outputs 
of the DTL gates can be OR-tied. Moreover, DTL 
doesn’t create heavy current transients in the power 
supply during switching. 

The input current to this Schottky DTL gate is 
low, typically 250 microamperes. This presents a 
very small load to the driving gate. As a result, 
each gate can drive a large number of similar gates 
—fan-out can be 25 or more. If the two 200-ohm 
resistors are used as a terminating resistance for a 
100-ohm line, there will still be enough current left 
to drive 10 gates. 

Another novel circuit is the binary divider on page 
80, bottom right. It combines the best of the slow 
and fast worlds: the Schottky transistors that com- 
prise the flip-flop switch at high speed, and the 
slow charge-storage transistors serve as tempo- 
rary-data-storage elements. 


Bridge work 


The special advantages of these circuits, and of 
the Schottky process that makes them possible, are 
particularly applicable to complex circuits. One 
such IC built at Intel is a 64-bit high-speed scratch- 
pad memory. It’s organized as 16 words by 4 bits; 
each of the 16 words is addressable through its own 
binary decoder. It has four data inputs, four address 
inputs, and four outputs. If it’s required, the outputs 
can be OR-tied with the outputs of other memory 
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STANDARD 


2002 


integrated circuit chips. 

The input load is equal to one unit TTL load, and 
the output can sink 20 milliamperes. The access 
delay—from address input to data output—is less 
than 50 nsec under a 20 ma resistive load. Power 
dissipation, including address buffering, decoding, 
sensing, and control logic, is typically 6 milliwatts 
per bit. 

These specifications represent major improve- 
ments over gold-doped IC performance. Because 
storage time has been eliminated by the Schottky 
process, the input-to-output propagation delay has 
been reduced by as much as 40% for certain criti- 
cal delay paths. And the variation in propagation 
delay due to temperature change is considerably 
less than in a gold-doped circuit. The higher hys 
values that result from the Schottky process also 
help keep propagation delay constant with tem- 
perature. This -is particularly important for cir- 
cuits that operate at low temperatures where large 
increase in delay, due to low hrs, is a chronic 
problem in gold-doped IC’s. 
- Another advantage of this higher-valued hy, dis- 
tribution is that transistors can be designed for 
higher forced-beta conditions, and power dissipa- 
tion is therefore lower. For the bipolar logic-buffer 
‘ chip, for example, power dissipation is reduced 
by about 30%. 

Most significant, perhaps, the Schottky process 
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Updated. Modifying 

the standard DTL gate 
with Schottky diodes 

and transistors (top) 
affords significant 
performance improvement. 


The new DTL. Without 
gold doping, many 
components become 
available to the IC 
designer. In the DTL 
gate at bottom left, input 
loading is reduced an order 
of magnitude by the 
substrate pnp’s that 
replace the input diodes 
normally used. 


MODIFIED 


Simplified. With the 
Schottky process, a binary 
counter stage (bottom 
right) is simple and 
occupies a small chip area. 
This is because charge- 
storage transistors (Q: and 
Q:, which are incompatible 
with the gold-doped 
process, can be used to 
store information. 


has resulted in higher functional density and 
smaller chips. A comparison of the Intel 64-bit 
memory with an equivalent gold-doped circuit is 
revealing: Although the gold-doped IC is made with 
the same mask tolerances, the Schottky version 
uses a 30% smaller chip. 

With all that they have going for them—high 
speed, greater functional density, and flexibility 
of design—Schottky-diode IC’s certainly appear to 
have a bright future. 
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A 4096-BIT DYNAMIC MOS RAM 


SESSION |: Memory | 


WAM 1.1: A 4096-Bit Dynamic MOS RAM 


By J.A. Karp*, W.M. Regitz and S. Chou 
Intel Corp., Santa Clara, California 
*Now with Intersil Corp, 


AN MOS SEMICONDUCTOR MEMORY ARRAY, designed for 
large main memory, will be described. High density has been 
achieved with a unique cell configuration, in combination with 
N-channel silicon-gate technology, while still maintaining manu- 
facturing tolerances consistent with high volume and low cost 
production. 

A block diagram of the memory array is shown in Figure 1. 
The 12-bit address buffer register accepts TTL inputs; 2.4-V 
minimum. The address is then decoded into 1 of 64 rows and 1 
of 64 columns, thus selecting one of the 4096 cells contained in 
a 64 x 64 matrix. The decoders use 6 input dynamic NOR gates 
which accomplish high speed address decoding at minimum 
power dissipation. The address buffers, which also utilize 
dynamic circuitry, not only convert the TTL levels to MOS 
levels, but also serve as a register, thus requiring stable address 
for only 100 ns; Figure 2. 

The 4096-bit memory uses a single high-level clock (CE) from 
which all internal timing signals are. triggered. The precharge 
clock input, normally used in dynamic RAMs to precondition all 
internal nodes, has been eliminated. The function of this clock is 
performed by the CE driver; thus all dynamic nodes are 
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4096 bits: 4096 words by 1 bit 

TTL compatible inputs; except clock 

Single clock input; high level 

Cell size < 2 mil2; 3 transistors 

Access time < 300 ns / 
Cycle time < 500 ns 
Active power < 100 uW/bit 

Power supplies: +12 V, +5 V,-5 V 

Standby power < 1 uW/bit 

Standard 22-pin package 


Memory array characteristics. 


preconditioned automatically between active memory cycles. 
The additional internal timing pulses in Figure 2(R, R and Row 
Voltage) are representative of the types of signals that have been 
generated on the chip. 

The cell has three minimum geometry transistors and 
occupies an area of 1.8 sq. mils. This small area per bit is the 
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FIGURE 1—Block diagram of 4096-bit dynamic RAM. 
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result of the cell configuration (Figure 3) which needs only 2-1/2 
interconnect lines per cell as follows: 


A)-A single selection line 1 activates a row of cells for both 
reading and writing. The Row Voltage is a tri-state signal 
gated on to the Row Select line by the Row Decoder. 

Reading of the cell is accomplished during the inter- 0 l0O0 200 300 400 500 
mediate level of this signal and writing or refreshing the 
cell is accomplished during the high level of this signal. 


The high level is terminated as soon as proper informa- ADD. NEE QQ 
[eae ea ae 


tion is gated to the cell in order to conserve power. 


B)—A single column line 2 allows transfer of information into cE 
or out of the cell. This line is connected to the column 
amplifier which is used to sense the information in the a 
cell, or write new information into the cell. CE 
C)—A ground line is shared between two adjacent columns of R 
cells, resulting in 1/2 connection per cell. 
Since the information is stored dynamically in the cell, it R 
must be refreshed periodically. To refresh the 4096-bit array, it 
is only necessary to perform memory cycles at each of the 64 ROW 
row addresses. VOLT. 
The memory can perform read, write or read-modify-write 
cycles controlled by a single TTL level input signal. DATA 
OUT 


1 Regitz, W. M., Karp, J. A., “Three-Transistor Cell 1024-Bit 
- §00-ns MOS RAM”, IEEE Journal of Solid-State Circuits, 
Vol. SC-5, No. 5; Oct., 1970. 


2“Pield-Effect Transistor Capacitor Storage Cell,’ United 
States Patent 3,585,613. FIGURE 2—Timing diagram showing inputs (ADD, CE) and 
internally-generated signals (CE, R, R, ROW VOLTAGE). 
Dashed lined shows worst case as predicted by computer 
analysis. 
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FIGURE 3—Microphotograph and schematic of 3-transistor cell. 
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N-channel goes 
to work with TTL 


Two dynamic shift registers and 
a static RAM operate from a single 
5-V supply, have 1-V thresholds 


by R. Abbott, H. Gopen, T. Rowe and D. Bryson, 
Intel Corp., Santa Clara, California 


C} Nearly ten years of increasing success with p-channel 
devices have brought MOS technology to the point 
where it can tackle the much harder job of n-channel 
processing. With the addition of silicon gates, such de- 
vices can outdo their p-channel forerunners in speed, 
packing density, and low threshold voltage. They also 
offer direct compatibility with transistor-transistor logic. 

For instance, a family of dynamic recirculating shift 
registers now being built with a silicon-gate n-channel 
process has speeds typically of 2 megahertz and thres- 
hold voltages of less than a volt, and is TTL-compatible; 
it enables an entire memory subsystem to operate from 
a single +5-vV power supply. The same process is being 
used to build a static 1,024-bit random-access memory, 
which also operates off a single + 5-v supply. 

A different n-channel process is being developed that 
will yield a 4,096-bit dynamic RAM. Like most MOS de- 
vices, this will require both positive and negative supply 
voltages. 

An alternate approach to designing n-channel devices 
is to forego compatibility with TTL and instead go all 
out for speed. This requires the device to be operated 
like its p-channel equivalent, at drive voltages of 10 to 
17 v. But for this initial family of circuits, TTL compati- 
bility was preferred, in order to promote its use in exist- 
ing systems. 


How to make n-channel devices 


The processing steps used in building the shift regis- 
ters and the static RAM are much the same as those for 
p-channel silicon-gate structures. The difference is that 
with n-channel devices the starting wafer is boron- 
doped, p-type silicon. The device threshold is then 
maintained at less than 1 V by controlling the impurity 
of the boron and the thickness of the gate dielectric. 

Fabrication starts with the p-type wafers being 
thermally oxidized to a thickness of about 1 micro- 
meter, and parts of the oxide being removed to define 
the diffusion and gate regions in the first of five photo- 
masking steps. Then a gate dielectric and a polycrystal- 
line silicon are deposited, and areas defining the gate re- 
gions of the transistors and the deposited silicon under- 
crossing are etched out, again by photomasking. 

Next comes a diffusion operation, in which n-type im- 
purities form the source and drain regions and the dif- 
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1. Making n-channels. Fabricating n-channels requires the deposi- _ 
tion of a gate dielectric and polycrystalline silicon (a), after which an © 
etching step defines the gate regions and the silicon-undercrossing 
(b). Next, contact holes are etched through the glass to the polysili- 
con and diffused regions (c). The finished device has the usual 
aluminum interconnection.and glass passivation (qd). 


fused undercrossing (see Figs. la and b). Glass formed . 
by the oxidation of silane is then deposited on the wa- 
fers at low temperature, and contact holes are etched 
through it to the polycrystalline silicon and the diffused 
regions (see Fig. Ic). 

Aluminum is evaporated onto this surface, and a 
fourth photomasking step forms interconnection pat- 
terns. Finally, the wafers are heated (alloyed) to insure 
ohmic contact between the aluminum and silicon, and a 
glass passivation layer is vapor-deposited over the sur- 
face. The fifth photomasking step removes the glass 
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from the bonding pad area and the scribe line. 

A cross section of the completed device is shown in 
Fig. 1d. For some more complex n-channel arrays, a 
buried contact may be desirable to save silicon space. 
The price is an additional photomasking step, used to 
etch contact windows before the polycrystalline silicon 
is deposited, so that the polysilicon may make direct 
contact with the diffused regions. 


Low voltage payoff: greater density 


There’s more to this low-voltage n-channel silicon- 
gate process than just the convenience of its compatibil- 
ity with TTL. The size of a given chip layout is a func- 
tion of the voltages applied to the various junctions, and 
since the low-voltage device requires only a +5-volt 
supply, and since the internal nodes are at even lower 
voltages, diffusions may be placed close together (0.4 
mils) and channels may be short (0.25 mils). Indeed, 
tighter and smaller layouts are possible without the ex- 
pense of tight alignment tolerances. 

The resulting high packing density permits larger ar- 
rays at a lower cost per bit. Table | lists the area savings 
it makes possible, and Fig. 2 shows the photomicro- 
graphs of the cells for the devices listed in the table. For 
a RAM, the n-channel array needs a memory cell less 
than half the size of a comparable p-channel device cell, 
enabling a chip that could handle only 256 bits as a p- 
channel RAM to accommodate 1,024 bits as an 
n-channel RAM. A similar density advantage applies to 
shift registers. 


p-CHANNEL 


n-CHANNEL 


n-CHANNEL 
2401 


2. Making room. Photomicrographs of RAMs and shift registers 
made with p- and n-channel technology show higher device density 
achieved by the n-channel devices. Compared here are static RAMs 
and a dynamic shift register, all units that are in production. 


6-18 


Table 1: 


Table 2: 
Comparison of p- and n- channe! products 


1,024 x 1 


17.2 mil? 
1404A 
Dynamic 


shift 
register 


Products 


n-channel 
recirculating shift 
register 


2,048 bits 
1,024 x 2 
125 x 151 mils? 


5.6 mils? 


+5 V/ground 


2.2 V/0.65 V (TTL) 


2.2 V/0.65 V (TTL) 


p-channel 
dynamic shift 
register 
1,024 bits 
1,024 x 1 
1418 x 136 mils? 


10.0 mils? 


Parameters 


Organization 
Chip size 


Cell size 


Power supply 


or supplier +5/-5/-12 V 


Data levels Veco —2/ Veco — 4.2 


Clock levels +5/-12V 


Number of clocks 


Clock capacitance | 140 F 
Maximum power 500 mW 350 mW 

dissipation at 25°C, (not including (including clock 
maximum frequency | clock generator) generator 


Maximum frequency 
(over temperature 
range) 


25 kHz @ T, = 70°C 


No external Ry 
needed 


Recirculating and 
chip select 


Minimum frequency 10 kHz @ Ty = 70°C 


Output requirement External R, needed 


Other features 


Another advantage of the low-voltage technology is 
the reduction of parasitic device interactions, which are 
caused by interconnection levels that have higher 
threshold voltages than the voltages used for the oper- 
ation of the circuit. This eliminates both large leakage 
paths and high capacitance caused by field inversion— 
problems that needed solving before n-channel devices 
could be implemented. 

The reliability of n-channel silicon-gate technology is 
dependent on the stability of the threshold voltages of 
the MOS transistors and the two levels of interconnects 
used in the array. In addition, the stability of dynamic 
storage circuits will be determined by the change in 
junction leakage, since it is this leakage that determines 
the retention time of the memory cell. 

The basis for the inherent high reliability of silicon- 
gate n-channel devices is the way the layers are ar- 
ranged. In the cross section of an n-channel MOS transis- 
tor and the interconnects used in the silicon-gate pro- 
cess shown in Fig. 3, three parts of the device are 
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3. Reliable. With the low-threshold silicon-gate process reliability is maintained in all parts of the n-channel device. In region labeled (a), the 
glass passivation insures against contamination, just as in p-channel fabrication. The silicon interconnects and the metal lines—regions (b) 


and (c)—are also well protected because of the thick oxide step surrounding the silicon gate and aluminum electrode. 
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4. Extra protection. Devices made with n-channel technology have better gate protection than p-channe! structures made with conven- 
tional fabricating techniques. The low n-channel breakdown voltage curve means that these n-channel structures operate at voltages sub- 
stantially lower than those causing gate breakdown. This extra gate protection, compared to p-channel! devices, adds.to device reliability. . 


identified; the thin field segment (a), the intermediate 
field with the silicon line interconnects (b), and the thick 
field segment with metal line interconnects (c). 

In (a) the MOS transistor is well protected from the ex- 
ternal environment by a glass passivation barrier im- 
pervious to contamination. The resistance of this barrier 
to contamination (Na+) has been well established in 
p-channel technology. The interconnects of (b) and (c) 
are also fabricated as in the p-channel technology pro- 
cess, and so are also well protected from external con- 
tamination. 

Included in Fig. 3 are the results obtained after 
800,000 unit-hours of stress on test devices. It shows that 
virtually no shifts in any of the threshold voltages were 
observed. The stress condition on these units was 5.5 
volts (maximum value for the TTL circuits) at 125°C. 
The junction leakages of these devices were also moni- 
tored, and no increases observed. 

An additional reliability feature of the n-channel 
technology is improved gate protection of the input de- 


vices—resulting in the lower breakdown voltage of the 
diode. Figure 4 compares the dc behavior of the same 
gate protection device for both n-channel and p-chan- 
nel processes as a function of the static charge applied. 
If the same dynamic impedance is assumed for the two 
diodes, the breakdown voltage in the n-channel version 
is consistently lower than in the p-channel device, re- 
quiring a large static charge to break down the gate di- 
electric. 


The real thing 


The n-channel 2,048- and 1,024-bit dynamic recircu- 
lating shift registers, the Intel 2401 and 2405, are exam- 
ples of products using this new technology. Both are 
directly TTL-compatible in all respects: inputs, outputs, 
clock line, and power supply. Table 2 compares a typi- 
cal p-channel silicon-gate 1,024-bit shift register 
(1404A) with the n-channel silicon-gate 2,048-bit re- 
circulating shift register (2401). A photomicrograph of 
the latter die is shown in Fig. 5. 
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Even though the n-channel device has twice as many 
bits as the p-channel version, it is only 20% larger in 
area. This is because the cell size of the 2401 is also 
nearly half the cell size of the 1404A, thanks primarily 
to the closer spacing allowed by the low voltage. Doubl- 
ing the bit capacity in each package results in consider- 
able savings for the user in printed-circuit board area, 
of course. The increase in bits per chip on 2401 will far 
outweigh the loss in yield on the slightly larger chip and 
result in a much lower cost per bit. 

Additionally the input levels are all referenced to 
ground, unlike the p-channel Mos levels that are refer- 
enced to the positive power supply voltage Vcc. Nearly 
50% of the user problems with p-channel MOs shift reg- 
isters are related to the clock levels. Incidentally. these 
are 16-V clock swings, and require either a voltage mul- 
tiplying stage between the 5-v TTL supply or a separate 
negative voltage supply. Since the 2401 has a TTL level 
clock and the clock capacitance is only 7 picofarads 


(worst case) over the full temperature range, as com- 
pared to over 100 pF for p-channel MOs clock inputs, 
system design is greatly simplified. 

The maximum power dissipation of the 2401 is less 
than the 1404A in system configurations. The 400-milli- 
watt worst-case power dissipation over the temperature 
range at maximum frequency of operation includes the 
clock generator power of the 2401. which is about 40% 
of the total. 

Since ease of use and TTL compatibility were empha- 
sized for this n-channel silicon gate design, the max- 
imum data rate of the 2401 is actuallv slower than that 
of the 1404A. This is due to the large load presented to 
the internal drivers. But even under these circumstances 
the device is capable of operating at 2 MHz. 

Finally. the 2401 saves the user some expense in ex- 
ternal parts. Many p-MOS shift registers must have sepa- 
rate circuits to recirculate the data and to address the 
bit lines. With the 2401, on the other hand, recircula- 
tion, and two chip selects for X-Y matrix selection are 
supplied on chip. Also included is an internal load re- 
sistor. removing the user’s need for external matching. 


5. Close fit. Unlike most p-channel memory devices, this 2,048-bit dynamic shift register made with n-channel technology is directly com- 
patible with TTL interface circuits in input, output, and clock lines; most significantly. it can operate from a single 5-V supply. 
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A Fully Decoded 2048-Bit Electrically Programmable 
FAMOS Read-Only Memory 


DOV FROHMAN-BENTCHKOWSKY, MeEmsBeEr, IEEE 


Abstract—-This paper describes a fully decoded 2048-bit electri- 
cally programmable read-only memory implemented with a novel 


floating-gate avalanche-injection MOS (FAMOS) charge-storage . 


device as the basic nonvolatile memory element. The memory is 
organized as 256 words of 8 bits, it is fully TTL compatible, and 
can be operated in both the static or dynamic mode. The memory 
afray was successfully fabricated with silicon gate MOS technology 
yielding functional devices with access times of 800 ns in the static 
mode and 500 ns in the dynamic mode of operation. The memory 
chip is assembled in a 24-lead dual-in-line package. 


are presently implemented in a variety of digital 

system and computer applications. Most available 
semiconductor ROMs are programmed permanently at 
the integrated-circuit fabrication stage by a custom mask 
that defines the desired information pattern. As a result, 
program changes in microprogramming applications as 
well as pattern changes during the debugging phase of 
digital systems involve the generation of a new mechan- 
ical mask for every modified ROM pattern. In addition 
to being an expensive step, it also limits the flexibility of 
ROM applications because of the delay involved in the 
production process. These limitations of mask program- 
mable ROMs have led to a growing interest in electrically 
programmable semiconductor ROMs in which the per- 
manent information pattern is recorded by application of 
an electrical signal. This allows programming changes to 
be affected without the expense and time delay involved 
in generation of a custom mask. 

The different proposed electrically programmable 
ROMs can be divided into two main categories: 1) ROMs 
in which a permanent (irreversible) change in the memory 
metal interconnection pattern is affected by an electrical 
pulse; and 2) alterable ROMs in which a reversible 
change in active memory device characteristics is in- 
duced electrically. 

The first category includes fusible-type ROMs, which 
are mainly bipolar memories with capacities of up to 512 
memory bits. Their main disadvantage is that they can- 
not be reprogrammed, which prevents complete functional 
testing before shipment, as well as pattern modification 
in case of programming error or needed change. 

The search for alterable (charge storage) ROMs stems 
from the need for low-cost fully tested field-program- 


cee ioe read-only memories (ROM) 
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mable ROMs as well as an attempt to provide a substi- : 
tute for the nonvolatile storage capability (retention of © 


stored information without an external power source) of - — 


magnetic memories. Most proposed alterable ROMs rely © 
on charge storage in a dielectric that forms part of the 
gate of an insulated-gate field-effect transistor. Feasibil- 
ity has been demonstrated for a metal-nitride-oxide-sili- 
con (MNOS) memory [1], [2] a metal-aluminum-oxide-:., 
silicon (MAS) memory [3], and a dual-gate MNOS 
memory [4]. Difficulties in controlling the electrical char- 
acteristics of the storage dielectrics and additional fabri- 
cation steps required to achieve on-the-chip decoding » 
have limited the realization of these approaches to un- 
decoded memory arrays of up to 256 memory bits. | 

Recently, feasibility of the ovonic amorphous semicon- — 
ductor memory device has been demonstrated by fabri- 
cation of an undecoded 256-bit memory array [5]. . 

The introduction of a novel MOS memory element— 
the floating-gate avalanche-injection MOS (FAMOS) 
charge-storage device—has led to the fabrication of a 
fully decoded 2048-bit electrically programmable ROM. 
The memory is organized as 256 words of 8 bits, it is 
fully TTL compatible and can be operated in both the 
static or dynamic decoding and sensing mode. The monoz. 
lithic memory array was successfully fabricated with 
silicon-gate MOS technology, yielding functional devices 
with access times below 800 ns in the static mode and 
less than 500 ns in the dynamic mode of operation. The 
memory chip is assembled in a 24-lead dual-in-line pack-_ 
age. 


I. Device STRUCTURE AND OPERATION 


A cross section of the FAMOS structure is shown in 
Fig. 1 with its suggested electrical symbol. It is essen-. 
tially a p-channel silicon-gate MOS field-effect transis- - 
tor [6] in which no electrical contact is made to the. 
silicon gate. The floating polysilicon gate is isolated from 
the silicon substrate by a SiO, layer of approximately 
1000 A and from the top surface by 1.0 u of vapor-depos-_ 
ited oxide. Operation of the FAMOS memory structure 
depends on charge transport to the floating gate by ava- 
lanche injection of electrons from either the source or 
drain p-n junctions. A junction voltage in excess of —30 
V applied to a p-channel FAMOS device (Fig. 2) with 
1000 A of oxide and 5-8-Q-cm substrate resistivity will 
result in the onset of injection of high-energy electrons 
from the p-n junction surface avalanche region to the 
floating silicon gate. The gate charging current is of the 
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Fig. 2. Cross section of FAMOS device under bias. 


order of 10-’ A/cm?. Since the silicon gate is floating the 
electron current through the oxide results in the accumu- 
lation of a negative charge on the gate. For a p-channel 
_ FAMOS transistor this negative charge will induce a 
~ eonductive inversion layer connecting source and drain. 
". The amount of charge transferred to the floating gate as 
“ @ function of the amplitude and duration of the applied 
- junction voltage is shown in Fig. 3. The presence or ab- 
sence of charge can be sensed by measuring the conduct- 
‘- ance between the source and drain regions. 
Once the applied junction voltage is removed, no dis- 
charge path is available for the accumulated electrons 
"since the gate is surrounded by thermal oxide, which is 
~"g very low conductivity dielectric. The electric field in 
the structure after the removal of junction voltage is due 
only to the accumulated electron charge and is not suffi- 
cient to cause charge transport across the polysilicon- 
-thermal-oxide energy barrier. The maximum stored charge 
of 4 X 10" electrons/cm? (Vw = 50 V, Fig. 3) results in 
‘an electric field of approximately 2 x 10° V/cm across 
the thermal oxide. Assuming current transport by Fowler- 
~ Nordheim emission from the polysilicon gate into the 
.. oxide, the estimated discharge current (for a polysilicon—- 
.- $102 energy barrier of at least 3.2 eV) is of the order of 
_10-*° A/cm? at 300°K. 
Charge decay plots as a function of time at 125° and 
. 800°C are shown in Fig. 4. The rapid initial decay sat- 
- urating with time cannot be explained by electron trans- 
port across the oxide. Its temperature and electric-field 
dependence correspond to that of positive-charge buildup 
at the SiO.—Si interface that occurs at high electric fields 
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Fig. 3. Charge accumulation on the floating gate as a function 
of charging pulsewidth for different values of pulse amplitude. 


under negative gate bias conditions [7], [8]. The activa- 
tion energy of charge decay was measured to be 1.0 eV. 
An extrapolation of the 300°C charge decay results in- 
dicates that 70 percent of the initial induced charge can 
be retained for as long as 10 years at 125°C. 

Since the gate electrode is not electrically accessible, 
the charge cannot be removed by an electrical pulse. 
However, the initial condition of no electronic charge on 
the gate can be restored by two nonelectrical methods. 
Illumination of the unpackaged device with ultraviolet 
light will result in the flow of a photocurrent from the 
floating gate back to the silicon substrate thereby dis- 
charging the gate to its initial condition. This erase 
method allows complete testing of a complex memory 
array before the package goes through final seal. Once 
the package is sealed, information can be erased by ex- 
posure to X-ray radiation. The radiation dose required 
is in excess of 5 X 10* rad, which is many orders of mag- 
nitude higher than the average yearly atmospheric radia- 
tion dose and easily attainable with available commer- 
cial X-ray generators. 


II. Memory CELL 


The FAMOS charge-storage device described above 
can be used as the basic storage element in a large mem- 
ory array. A circuit schematic of the memory cell and 
its associated decoding circuitry is shown in Fig. 5. The 
decode and sense circuits shown are common to both the 
PROGRAM and READ modes. To select a bit to be pro- 
gramed the address-decode inputs as well as the Vpp 
and Vgq lines are energized to —50 V. Programming of 
a memory bit is accomplished by coincidence selection 
of the X and Y select lines. The applied programming 
pulse is transferred to the selected FAMOS device that 
turns normally on due to the electron charge transferred 
to its floating gate. All other memory bits are not pro- 
grammed due to either the lack of a pulse on the Y select 
line or the absence of a transfer pulse on the X select 
line. The programming signal Vp is a —50-V 5.0-ms pulse. 
The amount of charge stored in a memory cell in re- 
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Fig. 4. Charge decay in a FAMOS device as a function of time 
at 125° and 300°C. 


Fig. 5. Memory cell with its associated decode and sense circuits. 


sponse to a programming pulse is typically 3.0 x 10-7 
C/cm?, which is equivalent to 10 V on the gate of a con- 
ventional MOS transistor. The load current required to 
program a memory bit is approximately 5.0 mA. The in- 
crease in programming pulse amplitude and duration to 
charge the memory cell compared to the data presented 
in Fig. 3 is due to the additional voltage drop across the 
decode circuits. These high applied voltages in the pro- 
GRAM mode could give rise to parasitic programming paths 
due to field inversions over the thick oxide regions on the 
chip. This potential problem was accounted for in both 
the circuit design and chip layout by providing a highly 
conductive programming path as compared to the para- 
sitic path as well as careful routing of high-voltage lines 
on the chip to avoid sensitive circuit nodes. 
Memory-cell operation in the READ mode is similar to 
that of other ROMs. A memory bit is selected by a coin- 
cidence of signals on the X and Y select lines. However, 
compared to the programming mode the READ mode volt- 
ages are substantially lower. Since the programming 
threshold is —30.0 V, the maximum reap voltage of —15 
V guarantees a wide margin to avoid disturbing the in- 
formation during the READ mode. Information in the se- 
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Fig. 6. Block diagram of memory organization. 


lected memory cell is sampled by the output sense circuit. 
If a “0” is stored in the cell (charge on the floating gate) 
the FAMOS device is on and the level at the input of 
the sense circuit is close to Vgc. A “1” corresponding to 
no charge stored in the cell is reflected by a more nega- 
tive level (close to Vpp) at the input of the sense circuit. 
Information can be decoded and sensed in either the static 
(no clocks) or dynamic mode (2 clocks). The static mode 
of operation eliminates the need for clocks at the expense 
of increased power dissipation and reduced speed, while 
the dynamic mode offers advantages in both performance 
categories. The option of the two modes of operation is 
achieved by parallel load transistors in the decode and 
sense circuitry, one connected to the clock lines and the 
other to Veg as shown in Fig. 5. The load device con- 
nected to the Vp terminal is connected to ¢, in the dy- 
namic READ mode. Mode selection is done by activating 
the clocks with Veg connected to Vee in the dynamic 
mode. Vgq is activated in the static mode with the clocks 
connected to Voc. 


IIT. Memory ORGANIZATION AND OPERATION 


The electrically programmable memory chip consists 
of a monolithic array of 2048 FAMOS devices organized 
as 256 words of 8 bits. A block diagram of memory or-. 
ganization is shown in Fig. 6. All circuit blocks are com- 
mon to both the ProcRAM and READ modes with the ex- 
ception of the procRAM data input buffers. In the proGRAM 
mode the eight output terminals are used as data inputs 
to determine the information pattern in the eight bits of 
each word, while word address selection is performed by 
the X and Y decoders through the input drivers. The 
PROGRAM data input circuitry for one of eight outputs is 
shown in Fig. 7. To inhibit the programming of a bit, a 
negative voltage is applied to the data input terminal. 
This voltage level is transferred to the inhibit transistor 
(chip select is enabled), which turns on and overrides 
the Y select signal. To allow programming of a selected 
bit, the data input terminal is kept at ground. Initially 
all 2048 bits are in the “1” state corresponding to nor- 
mally orr FAMOS devices (no charge on the floating 
gate). Information is introduced by selectively program- : 
ming “0” in the proper bit locations through charging the 
FAMOS devices from the procraM terminal. The supply 
(Vop, Vee, Vp), address (Ap — Ag) and data input (Dy; 
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Fig. 7. Circuit description of procrAM data input buffers. 


— Diya) voltages in the procRAM mode are detailed in 
Fig. 8. A timing diagram for the applied voltage sequence 
in the PROGRAM mode is shown in Fig. 9. In the READ 
mode the procRAM data input buffers are inhibited by the 
_chip select signal to cut off the feedback path from the 
output to the memory array established in the PROGRAM 
mode. Memory operation in the READ mode is the same 
as in conventional mask programmable ROMs. The in- 
put and output buffers provide for full TTL compatibil- 
ity and addressing is accomplished by the X and Y de- 
coders operating at the voltage levels detailed in Fig. 8. 
A selected memory cell with a charged FAMOS device 
will be reflected by a low “0” TTL level at the output, 
while a memory bit that is not charged will result in a 
high “1” TTL level. Both the static and dynamic modes 
of decoding and sensing are available in a single package 
through use of parallel load transistors as described in 
Section II. Mode selection is done by activation of either 
the clock lines (4;, ¢2) or the Veg terminal in the dy- 
namic and static modes, respectively. Typical access times 
are 400 ns in the dynamic mode and 700 ns in the static 
mode. A photomicrograph of the chip with the designa- 
tion of the different circuit blocks is shown in Fig. 10. 

- As can be seen from the above description, electrical 
‘programming of the memory is conceptually the same as 
operation in the READ mode with the exception of the 
voltage levels (Fig. 8). Hence, the memory can be easily 
programmed from punched paper tape or other data in- 
- put devices through an electrical programming terminal. 
Once programmed, the information pattern in the mem- 
ory can be erased (restored to the all “1” state) by ultra- 
violet light before packaging or by exposure to X-ray 
radiation as described in Section I. The ability to erase 
before packaging allows for complete testing of the mem- 
ory chip prior to shipment, which is a distinct advantage 
over once-programmable ROMs. Erasure of information 
in packaged devices by placing them in a commercially 
available X-ray generator allows for correction of pro- 
gramming errors as well as unpredictable future pattern 
modifications. 


IV. REvIABILITY 


One of the most important performance factors in an 
electrically programmable ROM (as in all other prod- 
ucts) is its long-term reliability. Reliability in-this case 
incorporates both the long-term functionality of the stan- 
dard MOS transistors and the long-term information re- 
tention of the FAMOS devices. Reliability data on sili- 
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Fig. 8. Applied voltage level to the memory in the procraM and 
READ modes of operation. 
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Fig. 9. Timing diagram for the procRAM mode. 


con-gate MOS memory products have been accumulated 
over a period of two years, resulting in many device 
hours without failure. This record is directly applicable 
to the programmable ROM since it is fabricated with 
the same silicon-gate MOS technology. As to long-term 
information retention, the charge-storage programmable 
ROM has a predictable time-temperature dependence for 
stored information. . 
The long-term decay (Fig. 3) has a logarithmic de- 
pendence on time with a slope of approximately 1.0 V 
per 5 decades of storage time at 125°C. This rate of 
charge decay extrapolates to storage retention times 
greater than 10 years at 125°C. The predictable time- 
temperature behavior of stored charge corresponding to 
an activation energy of 1.0 eV, allows for accelerated 
life testing of the programmable ROM. To guarantee 10- 
years retention at 125°C, the device has to be subjected 
to 300°C storage for only 10 h. This procedure is similar 
to existing high-temperature stress testing in MOS prod- 
ucts to detect potential contamination in the dielectric. 
A summary of reliability results on the 2048-bit elec- 
trically programmable ROM is shown in Fig. 11. The 
extrapolation of time from 125° and 200°C to the operat- 
ing temperature of 85°C is based on an activation energy 
for charge decay of 1.0 eV. The bias configuration in the 
temperature/bias test corresponds to worst case operat- 


‘ing voltages in the reap mode. No failures have been 
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Fig. 10. Photomicrograph of the memory chip with designation 
of the different circuit blocks. 
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Fig. 11. Reliability results for eee retention of the pro- 


grammable RO 


observed in any of the long-term information-retention 
tests. 

An additional reliability factor is the retention of the 
uncharged “1” memory state. It was pointed out in Sec- 
tion III that operation in the READ mode differs from the 
PROGRAM mode only in the lowering of the operating volt- 
ages from —50 to —14 V. This brings up the question of 
whether an uncharged memory bit can be slowly charged 
by repeated selection in the READ mode. Retention of the 
uncharged memory state as measured on a typical mem- 
ory cell is shown in Fig. 12. The time for a shift in 
FAMOS device turn-on voltage as a function of applied 
voltage to the memory cell is plotted for different values 
of turnon voltage shift. For an operating voltage in the 
READ mode of —14 V, a shift of Vp = 1.0 V is expected 
to take place after more than 100 years. A shift of at 
least 5.0 V is required before the uncharged memory 
state “1” is interpreted as a “O” by the sense circuitry. 
Another important aspect of reliability is the extent of 
preshipment testing. As pointed out in previous sections, 
because of its reprogrammable feature, the memory array 
can be completely tested before shipment for functional- 
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Fig. 12. Retention of the uncharged memory state as a function 


of applied voltage to the memory cell. 


ity and programming, as well as an accelerated high- 
temperature test to guarantee long-term retention. After 
the tests are performed, the information pattern is erased 
and the programmable ROM is shipped with a pattern of 
all “1” ready for programming. 


V. APPLICATIONS 


Despite the attempts of semiconductor manufacturers 
to lower mask charges and turn-around time for mask- 
programmable ROMs, the economics and design delay 
involved have limited the versatility of semiconductor 
ROM applications. The availability of a low-cost elec- 
trically field-programmable ROM opens new application 
areas, which heretofore have not been economically feasi- 
ble, mainly those in which relatively small quantities of 
different ROM patterns are required. For example, the 
implementation of computer lookup tables that provide 
a hard-wired means of performing certain routine com- 
puter calculations. The variety of possible patterns makes 
the use of mask-programmable ROMs costly and im- 
practical, while ideally suited for field-programmable 
ROMs. Other application areas that fall into the same 
category are binary-sequence generators, mitroprogram- 
ming control of central processors in computer terminals, 
and programmed logic arrays. 

On the other hand, in areas such as code converters, . 
character generators, and other applications in which 
large quantities of a few patterns are required the field 
programmable ROM augments the capability of mask 
programmable ROMs. In these applications, the initial 
custom mask cost is insignificant compared to the total 
cost of the high-volume standard ROM pattern. How- 
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ever, in order to finalize a decision on a given high- 
volume standard pattern a method of debugging is needed 
to avoid the costly phase of correcting potential errors. 
In this initial pattern definition phase the electrically 
programmable ROM is most economical and _ flexible. 
Since the flexibility of field-programmable ROMs is gen- 
erally achieved at the expense of chip area, once the pat- 
tern has been defined, the high-volume order is placed 
for mask-programmable ROMs, which are lower cost in 
high quantities. Hence the combination of the two pro- 
gramming methods offers both the flexibility and cost in- 
centive required to proliferate the use of semiconductor 
read-only memories in high-volume applications. 


VI. SUMMARY 


The introduction of a novel semiconductor memory 
element—the FAMOS charge storage device—and its im- 
plementation in a fully decoded 2048-bit electrically pro- 
grammable read-only memory constitutes a significant 
advance in the state of the art of semiconductor mem- 
ories. It is the first available large-capacity program- 
mable ROM in which the information pattern is recorded 
electrically by way of a reversible change in memory- 
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‘device characteristics. This was achieved by a com- 


bination of a new charge-storage structure and circuit 
techniques that allow its implementation in large-scale 
memory arrays with existing MOS processing techniques. 
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MERGE IN FOUR-CHIP PROCESSOR KIT 
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Processors of various degrees of complexity can be designed from 
three standardized chip elements and a fourth, a read-only memory, 
customized to each user’s specific purpose by his own microprogram 


CJ A vast new array of applications has opened up with 
the advent of low-cost minicomputers that the user can 
microprogram himself for any number of repetitive 
functions. These applications range from control of 
computer peripheral devices to hospital patient moni- 
tors to gaming machines. 

Such widespread use of microprogramed minicom- 
puters is made possible by large-scale integration that 
brings the cost about two orders of magnitude below the 
cost of the most inexpensive conventional mini- 
computer. One such computer, the Intel MCs-4, is com- 
posed of only four kinds of LsI chips. Three of these 
chip modules—a read/write memory, a processor, and a 
shift register—are standardized designs. The fourth 
module, a read-only memory, is programed to the user’s 
specifications. 

Among the potential applications for these micro- 
programed machines are, generically speaking, office 
. equipment, process control, and instrumentation. More 
specific examples abound in these and other categories: 
billing machines and point-of-sale terminals; numeri- 
cally controlled machines and traffic control, spectrum 
analyzers, navigational receivers, hospital patient moni- 
tors, intelligent terminals, and others. 

The MCS-4 consists of the 4001 read-only memory 
(ROM), which also contains a four-bit input/output sec- 
tion, or port; the 4002 read/write memory (RWM), 


which includes both main storage and status memory, - 
and a four-bit output port; the 4004 four-bit central 
processing unit (CPU); and the 4003 shift register (SR) 
for extending output functions—actually a medium- 
scale circuit because it holds only 10 bits and is on a rel- 
atively small chip. Input lines can be expanded with 
standard MSI multiplexers. 


Unexceptional chip size 


All employ p-channel silicon-gate MOS circuitry and 
have conventional measurements typical of today’s 
smaller LSI circuits—53 by 85 mils for the shift register 
and roughly 110 by 150 mils for the other three. Circuit 
density is quite high; each chip holds more than these 
measurements imply. The standard LSI circuits—the. - 
RWM, the SR, and the CPU—are produced in high vol- 
ume, and the only specialized part of the design—the 
ROM—uses the already proven economy of mask pro- 
graming. Because the customer can maintain the mask 
patterns for his ROMs as proprietary, he still has the ad- | 
vantages of a custom design, without the high cost that 
usually accompanies custom designs. | 

The 1/0 port of the ROM contains latches that store 
data being transferred from the computer’s data bus to 
the outside world—that is, the remainder of the system 
in which the computer is used. Should such external 
equipment require more than the number of output 


1. Read-only memory. This array of 256 eight-bit words can be used from one to 16 times in a Computer made from the 4-chip micro- 
cornputer set MCS-4. Chip also contains multiplexing and demultiplexing circuits to permit it to transmit and receive words through the pro- . 
cessor’s data bus, four bits at a time, and buffers for transferring data from the bus to the outside world. 
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2. Maximal. Up to 16 ROMs and 16 RWMs can be attached to one processor in the new chip set. Input and output are also handled through 
these chips; output can be expanded beyond four bits in parallel through shift registers. 


lines provided by the ROMs and RWMs, the SR makes 
more lines available, as described below. 

The status memory portion of the 4002 RWM is really 
only an extension of the main memory, differing from 
the latter primarily in its means of access. It is useful for 
storing labels, exponents, signs, and other control infor- 
mation associated with the data stored in the main 
memory. 

A typical computer configuration uses one CPU, to- 
gether with several RWMs and ROMs, and perhaps a few 
SRs. These configurations are flexible, economical, and 
easy to design.. 

Flexibility comes from easy program changes, ease of 
changing capacity, small size, and low power dissipa- 
tion. A short design cycle is possible because the system 
is controlled by a microprogram, which is inherently 
easier to design and implement than either random 
logic or custom LSI circuits. 

The system is economical to manufacture because all 
the chips come in standard 16-pin dual in-line packages 
that lend themselves to automatic insertion and enable 
the customer to implement rather complex functions 
with relatively few chips. Furthermore, because three of 
the four chips are standardized, the customer is not 
faced with a development charge for them, while mask 
charges for a new ROM are far less than development 
charges for custom LSI chips. Furthermore, even these 
mask changes can be saved, particularly during proto- 
type development, by using electrically alterable ROMs. 


Minimum system 


The smallest possible system would require one CPU 
and one ROM. The latter, which contains 256 words of 
eight bits each, as shown in Fig. 1, would hold programs 
and subroutines for the CPU, and perhaps some fixed 
data. The maximum amount of ROM that can be 
directly addressed is 16 chips or 4,096 bytes (see Fig. 2). 
While these 4,096 bytes require a 12-bit binary address, 
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the ROM communicates with the CPU over a four-bit- 
wide data path. Therefore the ROM contains, in addition 
to the usual decoding circuits, a demultiplexer to permit 
the address to come in three four-bit chunks and a mul- 
tiplexer to divide the output word into two more four- 
bit chunks. 

The cpu delivers the three address chunks in three 
cycles. The first two of these select one byte in each ROM 
chip, while the third selects one chip out of all the ROMs, 
which delivers its byte to the data bus. Following these 
three address cycles are two more cycles during which 
the CPU accepts the eight bits from the ROM--four dur- 
ing each cycle. 

The ROM chip itself contains the logic circuitry re- 
quired to keep track of which data transfer is in process 
at any time, so that it can react to each one in the 
proper way. 


Read/write memory 


Each RWM (Fig. 3) contains 80 characters of four bits 
each, organized as four 20-character registers. In each 
register, 16 characters are part of the main storage, and 
the other four characters belong to the status memory 
previously mentioned. Therefore, the 16 chips in the 
largest memory configuration contain 64 registers, or 
1,024 characters of main storage and 256 characters of 
status memory. 

The CPU controls both the RWM and the ROM chips 
through six control lines. One of the six synchronizes 
the clocks of all chips to that of the CPU. The other five, 
called command lines, control the chip addressing and 
the input/output function. These six control lines per- 
mit the single family of chips to be used in a wide vari- 
ety of system configurations, suitable for many appli- 
cations. 

If more than the 16 ROMs and 16 RWMs are needed 
for a particular application, external interface circuits 
and bank switching techniques similar to those used in 
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3. Read-write memory. Four 16-character data registers and four 
4-character status registers appear in the rectangular array at left 
center in this photo. Remainder of chip holds control and timing cir- 
Cuits, and output buffers similar to those in the ROM. 


some minicomputers permit the necessary number of 
additional memory chips of either type to be connected. 

The sr, a 10-bit static shift register into which data is 
loaded serially, has output lines that are accessible in 
parallel. Logic included on the chip disables the output 
lines until all desired data has been shifted into it from 
a ROM or RWM. 

The CPU is a small but complete processor (Fig. 4) ca- 
pable of executing 45 different instructions in a basic in- 
struction cycle of 10.67 microseconds. It is built around 
a four-bit adder, and has a set of 16 four-bit scratchpad 
registers. Internally, the CPU is strictly binary; but for 
applications using decimal arithmetic, the binary result 
of adding two binary-coded decimal digits can be re- 
turned to BCD by a special instruction. 

Of particular interest is the set of four 12-bit address 
registers that permit as many as three levels of subrou- 
tine nesting. During the execution of a program, when a 
jump to a subroutine is encountered, the instruction ad- 
dress register contains the address of the next instruc- 
tion to be performed after the subroutine is executed. 


Bigger and better 


Intel’s new microcomputer set, as described in the ac- . 
companying article by Ted Hoff and Federico Faggin, 
is actually only one of several microcomputers now 
appearing on the scene. Recently, Intel also an- 
nounced its 8008 microcomputer, an eight-bit CPU on 
one 125-by-170-mil chip [Electronics, March 13, 
p.143]}. Like the 4004, the 8008 requires external 
memory and interface chips to make a full computer, 
which, in turn, would be only one part of a larger sys- 
tem in most applications. However, it does have an in- 
ternal scratchpad memory of seven words, plus an in- 
struction counter to use with the external memory. 

‘But the eight-bit chip doesn’t make the four-bit ob- 
solete,’’ insists Hoff. ‘For example, the 4004 is more 
economical; a functioning computer can be made with 
it and only one additional memory chip." But the min- 
imum system using the 8008, he says, would require 
15 or 20 packages and would be a correspondingly 
more powerful system. 

—W.B.R. 


| 


Its contents must be temporarily stored to permit the 
main program to resume following the subroutine. In 
the 4004 CPU, the subroutine may itself branch to a sec- 
ond subroutine, and that one to a third; three of the 
four address registers store three address words for this 
nesting process, while the fourth holds the address of 
the current instruction. 


Extended capability 


For a complex system, requiring more extensive pro- 
cessing capability, multiple processors can be used. In 
such a system with two central processing units, one 
CPU could be dedicated to control functions and the 
other to arithmetic functions. They could communicate 
with each other through the I/O ports, or they could 
share a common memory through a minimum of exter- 
nal circuitry. 

Each 10.67-us instruction cycle is divided into eight 
machine cycles of 1.33 ps each, as shown in Fig. 5. Dur- 
ing the first three of these, an address is sent out to the 
ROM. Data from the ROM is returned to the CPU during 
the next two cycles and is processed during the last 
three. Some instructions require an additional 10.67-ps 
cycle to fetch eight more bits from the ROM. 

The individual machine cycles are driven by a two- 
phase clock running at 750 kHz. 


Steps in design 


All four LsI chips may operate with a single -15 volt 
supply voltage with respect to ground; or they may be 
made compatible with transistor-transistor logic by op- 
erating them between -10 volts and +5 volts. 
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4. Processor. Heart of the four-chip set, this central processing unit is a small but complete general-purpose computer that can execute 45 
instructions. The four-bit adder, its central component, is at left center, as Key diagram shows. Address registers at right are important fea- 
tures. Processor works.exclusively in binary, but has a special instruction that translates a pure binary sum to decimal form. Four white lines 
around left, bottom, and right are bus interconnecting major sections on chip. 


‘For a particular application, designing a system 
around the MCS-4 is quite simple. There are essentially 
seven steps: 

® Define the input/output requirements in terms of the 
peripheral equipment that will be needed. 
~@ Define the amount of storage needed, and from that 
determine the number of RWM chips. 
@ Define the amount of control and/or program, and 
-. thus the amount of read-only storage needed; this de- 
termines the number of ROM chips. (This step may in- 
volve an iterative procedure.) 

= Specify shift registers as needed to increase the capac- 
ity of output lines. 

@® Write the program. 

= Build a prototype system, implement the program 
and controls in electrically programable ROMs, and get 
the bugs out. 

# Submit the program for manufacturing mask-pro- 
' gramed ROMs for volume production. 

In defining the input/output requirements, the first 

_ step in the design sequence, a number of soft- 
. ware/hardware alternatives are available. For example, 
if the system includes a keyboard, it will interface with 


the computer’s data bus through either a ROM or a 
RWM. This interface may be capable of climinating 
bounce from the key depressions, encoding the data, 
and rejecting the results of a multiple key depression; or 
all these functions may be implemented in software and 
executed in the CPU. 


Keyboard design 


An approach that uses little hardware and a moder- 
ate amount of software, shown in Fig. 6, places each key 
at the intersection of one input line of a ROM port and 
one output line of another ROM or RWM port. The pro- 
gram continually scans the keyboard by sequentially 
placing a single binary | on each output line; if any key 
has been depressed, that | reappears on the correspond- 
ing input line. The particular combination of output 
and input lines establishes the key’s identity, and may 
be translated by a code conversion routine in the CPU 
into a command or character in conventional ASCII or 
other code. By requiring an encoded character to ap- 


_pear on several successive scan cycles, the CPU obtains 


key debouncing. With other suitable programs the CPU 
can detect multiple keying or key rollover. 


5. Eight-In-one. Instruction cycle of 10.67 microseconds comprises eight 1.33-u4s machine cycles in 4004 CPU. First two cycles transmit ad- 
dress of desired word in ROM, third selects one of up to 16 ROMs. Then the addressed word returns to the CPU in two cycles; instruction is 
executed in remaining three cycles, plus another full eight if needed. Clock frequency is 750 kHz. 
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A larger keyboard may be more conveniently 
scanned by the 4003 sr, as shown in Fig. 7. Here only 
two output port lines are used. One of the lines is pro- 
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6. Keyboard interface. For data entry to system, the MCS-4 scans 
all columns on keyboard; signal reappearing on a row input shows 
that a key has been depressed, and timing of signal relative to scan 


identities the key. Program then encodes the signal, and also elimi- 
nates bounce and multiple keying. 
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7. More hardware, less software. For larger keyboards, the 4003 
shift register extends the scan while simplifying its operation. 
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8. Still more hardware. Direct approach requires external encoding 
and debouncing. and works with minimum support from program. 


gramed to provide a clock pulse for the shift register, 
and the other supplies data. A single binary 1 is loaded 
into one end of the register and shifted along it; the par- 
allel outputs effectively scan the large keyboard, and 
appear at the input port just as with the previous — 
example. 

A third design that uses the most hardware and very 
little software, shown in Fig. 8. requires an external 
diode encoder and debouncing circuits. In this arrange- 
ment the key depressions are read directly into the com- 
puter without any scanning. 

The amount of read-write storage depends on the 
number of characters to be stored at one time and the 
number of bits that are to be stored in read/write stor- 
age as opposed to ROM. 

For example, if a particular computer is to work with _ 
16-digit numbers and may have to store 10 such num- — 
bers at any one time, these numbers will occupy the 
main-storage registers in three RWM packages, leaving. 
two spare registers in one of the packages. These three 
packages also contain status memory with a capacity of © 
48 characters or 144 bits. - 


Calculating ROM needs 


In most applications using the MCS-4 chip family, 
normal programs are stored in the ROM. However, when | 
the user desires to load programs at execution time, it is 


open to him to program the MCS-4 to operate in inter- wer Ds 


pretive mode. 


In this mode, the program in the ROM fetches “data” 


from RWM and interprets it as new instructions for the 
CPU, jumping as required to subroutines kept in the 
ROM. Interpretive mode programs may also run with 
pseudo-instructions kept in ROM; such programs often 
us¢ ROM more efficiently than conventionally written 
programs. In either case, the designer planning to use 
interpretive mode must allow space for the programs 
when defining his memory requirements. — 

Determining the amount of read-only storage is a 
good deal more difficult. It depends on the system com- 
plexity and sophistication. The ability to make an edu- 
cated guess early in the design process largely depends 
on experience. 

With the number of ROM and RWM chips established, 
another look at 1/0 requirements may be in order if the 
number of 1/0 lines provided by these packages is sub- 
stantially different from that assumed in the first step of 
the design process. But if the 1/O layout is firm, the 
number of lines can be increased if necessary through 
the use of SRs. 

After development of the program that the computer 
will execute, the next step is to simulate the program be- 
fore committing it to mask tooling for the production of 
ROMs. The suggested approach is to use programable 
ROMs, instead of the 4001, at the prototype stage. When 
all the bugs are out of the system in the simulation, the 
truth tables for the program can provide the data for 
mass-produced mask-programed ROMs. 

The MCS-4 is just a beginning. More complex cpus 
are being designed [see “Bigger and better,” p. 114], and 
will be common before long. Some of them will be 
faster, and some of them will be extremely flexible, 
thanks to the use of programable ROMs. C] 
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THE MCS-4—AN LSI 
MICRO COMPUTER SYSTEM 


by F. FAGGIN, M. SHIMA*, M.E. HOFF, H. FEENEY, S. MAZOR 
INTEL CORP. 3065 BOWERS AVE. SANTA CLARA, CALIFORNIA 95051 
*BUSICOM CORPORATION, TOKYO, JAPAN 


Reprinted from IEEE 1972 Region Six Conference; 
copyright 1972 by the Institute of Electrical and 
Electronic Engineers, Inc. 


ABSTRACT 


The MCS-4 is a totally self-contained four-bit 
general purpose microprogrammable computer in 
component form. It consists of four basic ele- 
ments: the CPU (central processing unit), the 
ROM (read only memory), the RAM (random ac- 
cess memory), and the SR (shift register). They 
are fabricated by MOS silicon gate technology 

' and packaged in economical sixteen pin DIPs_ to 
minimize board area and reduce system cost. 


Using combinations of these standard building 
blocks, any degree of customization may be 
built into these powerful microprogrammed four- 
bit computers. Using as few as two devices, a 
CPU and a ROM, a four-bit microprogrammed ded 
icated computer may be built for under $50. 


This paper describes this new micro-computer set, 
highlighting the system partitioning and the basic 
CPU hardware instruction set. 


INTRODUCTION 


LSI technology provides new components for sys- 
tem design -- microcomputers. With the MCS-4, 
a four-bit LSI microcomputer set fabricated with 
the p-channel silicon gate MOS process, the 
power of a general purpose computer is available 
to every system designer as an alternative to 
conventional designs of random logic systems. 
The MCS-4 can provide the same control and 
computing functions of a minicomputer in as few 
as two sixteen pin DIPs and costs nearly two 
orders of magnitude less. 


This set of components is not designed to com- 
pete with the minicomputer, but rather to extend 
the concept of the dedicated computer into new 
applications where the minimization of cost and 
size are very important, but where speed is not 
a mojor factor. 


_A major goal in the development of the MCS-4 
was to devise a computer architecture allowing 
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system partitioning into a minimum number of 
sixteen pin packages. The result is a set of 
standard LSI building blocks which are manu- 
facturable in high volume and are flexible 
enough to be used in a variety of applications. 


The partitioning resulted in four MOS micro- 
computer building blocks: 


CPU -- four-bit parallel processor 
ROM + I/O -- 256 words x eight bits / 
four I/O lines metal mask 


programmable ROM 


RAM + output - 80 four-bit characters / four 
output lines 


SR -- ten-bit shift register (serial in, serial 
out, parallel out) 


ADDITIONAL 
RAM 


CONTROL 


“0 
PORT 


CLEAR 
1/0 LINES SYNC RESET V0 OUTPUTS 


SERIAL 
OUTPUT 


EXPANDED OUTPUTS 


Figure 1. MCS-4™- BASIC SYSTEM 


Time multiplexing was used extensively to re- 
duce the pin count and minimize circuit area. 
The MCS-4 is a totally self-contained system; 
no additional interface components are necessary. 


The circuits operate with a Single supply voltage 
of -15v and two non-overlapping clock phases, 
G1 and Bo. 


The heart of each system is a Single chip CPU 
‘which performs all the control and data process- 
ing functions. Directly interfacing with the CPU 
are ROMs which store microprograms and data 
tables and RAMs which store data and pseudo 
instructions. The MCS-4 communicates with 
peripheral devices through I/O "ports" provided 
on each ROM and RAM chip. In addition, ten- 
bit parallel shift registers can expand the I/O 
capability of the system, 


Figure 1 shows the basic MCS-4 system. All 
address data and instruction communication is 
carried out on the four-bit nondedicated data bus; 
System synchronization and memory control are 
provided by the CPU, 


The basic system timing for a typical instruction - 


is shown in Figure 2. A total of eight clock 
periods is required for the addressing, fetching, 
and execution of a single word instruction. An 
instruction cycle is completed in 10.8 us when 
operating at a clock frequency of 750 khz. In 
a typical operating sequence, the CPU sends out 
a twelve-bit address in three successive four-bit 
bytes during the clock times Aj, A2, and A3. 


Address Sent to ROM From CPU ———» 


Data 
Bus 
Contents 


Middle 4-bit 
Address to 
ROM's ROM’s (Chip 


Select Code) 


OPR to CPU 


y] 

This address is received by the ROMs, and an 
eight-bit instruction word is selected (one of 
4096 instructions). The eight-bit instruction is 
sent back to the data bus in two four-bit bytes 
during the following two cycles, M1 and M2. 
During the final three clock cycles, X1, XQ, 
and X3, the instruction is interpreted and exe- 
cuted. 


THE CPU (SYSTEM CONTROL) 


This single chip CPU is the fundamental com- | 
ponent of the system. It provides the complete , 
memory addressing, instruction interpretation, 
and control for the total system. The CPU con-” 
tains the following functional blocks (refer to 
Figure 3): 


Address Register and Address Incrementer 


The address register is a dynamic RAM » 
array of four twelve-bit words. One 
word is used to store the effective ad- 
dress (program counter) and the other 
three words are used as a stack for 
Subroutine calls. Thus nesting up to 
three levels is possible. The program 
counter is incremented on each in~ 
struction cycle as the address is sent 
out from the CPU. With the twelve-bit 
address the CPU can directly address up 
to sixteen ROMs, each containing 256 
eight-bit words. 


INSTRUCTION CYCLE 


Instruction Sent to 
CPU From ROM 


1.35 us —a 


Execution of Instruction —————_»- 
Data is Operated on in the CPU, Or 
Data or Address is Sent to/from the CPU : 


If IR" The 
Selected 4001 
Or 4002 Are 
Enabled, Other- 
wise The.CPU 
ts Enabled 


The CPU 
Is Enabled 


Data or Address 
to RAM’s and 
ROM’s If 10!) 
Or SRC!2) 

Data to CPU 

if OR’) 


OPA Out Address to 
RAMs If 


SRCi2) 


Snstruction to CPU 


OPA to CPU 
and ROM's 
and RAM’s 
if IO") 


(Not Used) 


{1} 10 instructions control the flow of information between accumulator in CPU, I/O tines in ROM’s and RAM’s and RAM storage. TOR stands for IO Read. in this 
case the CPU will receive data from RAM storage locations or L/O input lines of 4001's. 


(2) The SRC instruction designates the chip number and address for a following IO instruction. 


Figure 2, MCS-4™- BASIC INSTRUCTION CYCLE 
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ui cm cM CM CM 
Yoo VYec RAM, RAM, RAM, RAM, 
oO 


CM-RAM 
OUTPUT BUFFERS 


ADDRESS 
INCREMENTER 
CM-ROM COMMAND Ci 
OUTPUT CONTROL 


BUFFER REGISTER AMPLI. & MULTIPLEXER 


ADDRESS 
REGISTER 
(PROGRAM 
COUNTER 


a & & STACK) 
ACCUMULATOR mu 4x 12 BIT 
ND CARRY F/F OPA OPR DYNAMIC 
RAM 
MUX 
& SHIFTER 
REFRESH 
COUNTER 
EFFECTIVE 
INSTRUCTION REGISTER ESS 
oP. OPR 
ADB 
BUFFER REG. 
REFRESH 
COUNTER & 
DYNAMIC RAM 


INTERNAL DATA BUS 


Figure 3. 4004 CPU BLOCK DIAGRAM 


Index Register In addition to the basic functional blocks, in- 
ternal timing, ROM and RAM enable control, and 
Sixteen four-bit general purpose registers data bus I/O control are included in the peri- 
are provided for use as scratch pad regis- pheral circuitry. 
ters or memory pointers. These registers 
may be accessed individually or in eight- The instruction repertoire is permanently stored 
bit register pairs. in an associative memory in the CPU (the in- 
struction decoder). It consists of three basic 
Four-bit Arithmetic Unit groups of instructions: 
This is a four-bit parallel adder with rip- Sixteen machine instructions (basic control 
ple through carry. All arithmetic instruc- of the address and index registers) 
tions are executed using this functional 
block. Since this system operates on Fifteen I/O and RAM instructions (commun- 
four-bit bytes of data, it can be used ication with peripheral devices through 
directly for computation in BCD. The the ROM and RAM I/O ports and character 
result of an operation between two BCD storage in RAM memory) 
numbers is computed in binary; using 
the decimal adjust accumulator (DAA) in- Fourteen accumulator group instructions 
struction, the result of the operation and (basic accumulator processing instructions) 


the carry bit are converted back to BCD. 
The complete instruction set including the 
Instruction Register and Decoder mnemonic, binary code, and description is pre- 
sented in Figure 4. 
Eight-bit instructions are received, stored, 
and decoded to generate control signals 
for all other functional blocks. 


6-34 


Figure 4. MCS-4™- INSTRUCTION SET 


[Those instructions preceded by an asterisk (*) are 2 word instructions that occupy 2 successive locations in ROM] 
MACHINE INSTRUCTIONS : 


DESCRIPTION OF OPERATION 


Fetch immediate (direct) from ROM Data D2, 0; to index register pair 
tocation RRR_(2) 


Send register control. Send the address (contents of index register pair RRR} 
to ROM and RAM at X2 and X3 time in the Instruction Cycle. 


Fetch indirect from ROM. Send contents of index register pair location 0 
out as an address, Data fetched is placed into register pair jocation RRR 


Jume indirect. Send contents of register pair RAR out as an address 
at Ay and Az time in the Instruction Cycle. 
Jump unconditions! to ROM address A3, Az, Ay. 


Jump to subroutine ROM address Az, Az, Ay, save old address. (Up 1 level 
in stack.) 


Increment contents of register RRRR, (3) 

Increment contents of register RRAA, Go to ROM atidress Az, Ay 
{within the seme ROM that contains this ISZ instruction) if result # QO, 
otherwise skip (go to the next instruction in sequence). 


Add contents of register RRRA to accumulator with carry. 
Subtract contents of register RRRR to accumulator with borrow, 


Load contents of register RRRR to accumulator. 


INPUT/OUTPUT AND RAM INSTRUCTIONS 


(The RAM's and ROM'‘s operated on in the 1/O and RAM instructions have been Previously selected by the last SRC instruction executed.) 


DESCRIPTION OF OPERATION 


Write the contents of the accumulator into the Previously selected 
0 o 00 RAM main memory character. 
Write the contents of the accumulator into the previously selected 
| ooo1 | RAM output port. (Output Lines) 
Write the contents of the accumulator into the previously selected : 
ROM output port. (1/0 Lines) 


Write the contents of the accumulator into the Previously selected 

RAM status character 0. : 
Write the contents of the accumulator into the previously selected 

RAM status character 1. 


Write the contents of the accumulator into the previously selected 
RAM status character 2. 
Write the contents of the accumulator into the previously selected 
RAM status character 3. 
Subtract the previously selected RAM main memory character from 
accumulator with borrow. 


Read the previously selected RAM main memory character 
into the accumulator. 
101 1 


Read the contents of the previously selected ROM input port 
into the accumulator. (1/0 Lines) 


Add the previously selected RAM main memory character to 
accumulator with carry. 


Read the previously selected RAM status character 0 into accumulator. 


Read the previously selected RAM status character 1 into accumulator. 


Read the previously selected RAM status character 2 into accumulator. 


Read the previously selected RAM status character 3 into accumulator. 


Complement accumulator. 


i. Oko | Rotate jeft. {Accumulator and carry) 


Rotate right. (Accumulator and carry) 


Transmit carry to accumulator and clear carry. 


Decrement accumulator 


Transfer carry subtract and clear carry. 


Set carry. 


Decimal adjust accumulator. 


Keyboard process. Converts the contents of the accumulator from a 
one out of four code to a binary code. 


Designate command fine. 
— oe 


notes (1 The condition code is assigned as follows 


Cy = 1 Invert jump condition G = 1 Jump if accumulator is zero C4 = 1 Jump if test signa! is a 0 
C, = 0 Not invert jump condition C3 = 1 Jump if carry/link is a1 

arr is the address of 1 of 8 index register pairs in the CPU. 

Srarr 1s the address of 1 of 16 index registers in the CPU. 


(Geach RAM chip has 4 registers, each with twenty 4-bit characters subdivided into 16 main memory characters and 4 status characters. 
Chip number, RAM register and main memory character are addressed by an SRC instruction. For the selected chip and register, however, 
status character locations are selected by the instruction code (OPA). 
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THE ROM (CONTROL MEMORY AND I/O) 


This device performs two very distinct but inde- 
pendent functions in the system, storage of the 
instruction sequence, and input/output functions 
for communication with peripheral devices. The 
four input/output lines are provided on the ROM 
rather than the CPU to reduce the individual 
package pin count. Inputs or outputs may be 
custom selected to individual system require- 
ments at the same time that the metal mask ROM 
program is prepared. As more ROMs are added 
to a system, the number of I/O ports is also in- 
creased. 


The ROM memory array is organized as 256 eight- 
bit words. When a particular ROM word is ad- 
dressed by the CPU, the address is stored in a 
register included in the ROM and the eight-bit 
ROM word is multiplexed into two four-bit bytes 
and sent to the system data bus. Since the CPU 
can directly address up to sixteen ROMs, a bi- 
nary code for each ROM must also be programmed 
in the metal mask. 


The ability to program both the memory and the 
1/O ports provides custom "personality" of the 
system along with the economic advantage of 


0)0,D2D: 2 
SYNC RESET 04203 ~CM-RAM3 


using high-volume standard LSI devices. 


THE RAM (DATA STORAGE AND OUTPUTS) 


The RAM also provides two distinct and indepen- 
dent functions, data and pseudo-instruction stor- 
age and output communication with external 
peripheral devices. 


The RAM is organized as four registers, each 
containing sixteen four-bit main memory charac- 
ters and four four-bit status characters. If the 
system is operating with decimal arithmetic (all 
decimal numbers represented in BCD), each re- 
gister in the RAM can Store a complete sixteen 
digit decimal number. The status characters 
provide additional storage for the sign, decimal 
point position, exponent of the number, or other 
control information. RAM and 1/O line address- 
ing is accomplished in a rather unique way. The 
five CPU command lines (CM-ROM, CM-RAMj) 
control the way in which ROM and RAM chips 
interpret the data on the data bus. 


RAMs can be arranged in four banks of four chips 
each, each bank controlled by a separate CM-RAMj 


line (refer to Figure 5). 


SERIAL 


Figure 5. MCS-4™- SYSTEM INTERCONNECTION 
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To operate on an arbitrary location in RAM, the 
following instruction sequence is required: 


1, The command line must be designated (DCL 
instruction) 


2. The chip, register, and character must be 
selected (SRC instruction) 


3. The operation is performed on the selected 
character (RDM, WRM, ADM, instructions), 
If an I/O instruction is executed (WMP) the 
content of the CPU accumulator is latched 
on the output port. 


Operations on the I/O port of the ROM chips are 
accomplished in a similar manner. 


THE SR_(1/O EXPANDER) 


To increase the number of output lines for peri- 
pheral communication, the SR chip (MSI function) 
was added to the set. 


This is a ten-bit serial-in, parallel-out, serial- 
out, static shift register that directly interfaces 
with the I/O ports provided on ROM and RAM chips. 


CONCLUSION 


A system using this set of devices wil] usually 
consist of one CPU and from one to sixteen ROMs, 
up to sixteen RAMs and an arbitrary number of 
SRs. A minimum system could be designed with 
just one CPU and one ROM, 


Using microcomputers, changes in system func- 
tion can be easily implemented by changing the 
ROM program rather than by costly alteration of 
random logic hardware. The MCS-4 offers tre- 
mendous flexibility of design and allows the user 
to have many of the desirable features of a cus- 
tom MOS LSI design, small package count, a set 
of components uniquely his own (each user's pro- 
grams are his proprietary property), and yet none 
of the disadvantages of a long development cycle 
and high development cost associated with custom 
LSI design. The short design cycle and flexibi- 
lity associated with ROM programming allows 
much more rapid response to market demands than 
is possible with custom LSI, providing insurance 
against obsolescence, 


The important features of the MCS-4 family are 
summarized in the following table: 


Pour-bit parallel CPU with 45 instructions 


Instruction set includes conditional branching, 
jump to subroutine, and indirect fetching 


Nesting of subroutines up to three levels 
Sixteen four-bit general purpose registers 


Decimal and binary arithmetic modes 


Synchronous operation with memories 
Direct compatibility with ROM, RAM, and SR 
Directly drives up to: 


- 4096 eight-bit words of ROM (sixteen chips) 
- 1280 four-bit RAM characters (sixteen chips) 
- 128 I/O lines (without SR) 

~ Unlimited I/O (with SR) 


Memory capacity expandable through bank 
switching. 


Two-phase dynamic operation 

Single power supply (Vpp = -15 volts) 

10.8 us instruction cycle 

Addition of two eight-digit numbers in 850 Ms 
P-channel silicon gate MOS 

Sixteen-pin DIP package 

Minimum system: one CPU and one ROM 


To add even more flexibility and further accele- 
rate the design cycle, the CPU and RAMs may ¢ 
be interfaced with conventional electrically pro- 
grammable and erasable ROMs. This will allow 
fast program development and provide a viable | 
approach to build few-of-a-kind systems. 


The microcomputer is already Playing an impor- 
tant role in today's system designs. 
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CONSIDERATIONS FOR THE USE OF MICRO 
COMPUTERS IN SMALL SYSTEM DESIGN 


by M.E. HOFF — MANAGER, APPLICATIONS RESEARCH 
INTEL CORP. 3065 BOWERS AVE. SANTA CLARA, CALIFORNIA 95051 


SUMMARY 


Microprocessors consisting of a very few 
LSI Circuits are a new tool for the electron- 
ics designer. While they require the 
designer to learn to write programs in addi- 
tion to performing electronic hardware design, 
the advantages offered by smal | systems using 
these microprocessors often make the effort 
worthwhile. ‘An actual example of such a 
design, a unit for encoding data into field- 
~ programmable ROM's, is discussed. 


Integrated circuit technology has made 
dramatic advances since the development of 
the first monolithic circuits in the early - 

1960's, This technology has proven particu- 
larly appropriate for manufacturing circuit 
elements for digital systems. Initially 
single gates were implemented on each silicon 
die, but the complexity of these circuits 
rapidly grew as more complicated functions 
such as flip-flops, multi-bit counters and 
shift registers and ultimately memory arrays 
were developed. MOS processes have al lowed 
even higher component densities than the 
early bipolar processes and now allow suf- 
ficient function to permit the central 
pracessor of a simple computer to be produced 
on a single chip of silicon. With this 
technology, a complete digital computer can 
be implemented with very few circuit packages. 


Already the minicomputer has, because 
of its small size and low cost, produced a 
revolution in systems design. As the new 
single-chip computers cost an order of 
magnitude less than the typical minicomputer, 
an even greater revolution in the use of 
digital processing is likely. 


The availability of such extremely low 
cost computers makes possible a number of 
new approaches to the design of such smal | 
systems as test instruments, computer termi- 
nals and controllers for computer peripherals. 


The use of these "microcomputers" rather than 


more conventional logic in such designs offers 
a number of advantages. Typically, most of 
the "personality" of the system using a 
microcomputer is determined by the program 

for the microcomputer rather than by wiring. 
This program is typically stored in read-only 


Reprinted from 1972 Wescon Technical Papers, Session 26; 
copyright 1972, Western Electronics Show and Convention, 
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memory. As a result the characteristics of 
the system can be radically changed by 
changing the content of the read-only memory. 
This feature allows the designer to make 
radical changes in equipment character istics 
far more easily than when conventional design 
techniques are used. 


Nevertheless, there are certain disad- 
vantages associated with the use of microcom- 
puters for a system design. Perhaps the most 
important disadvantage is the necessity for 
the designer to be capable of efficiently. 
programming the microcomputers as well as 
considering their electrical requirements. 
This programming must usually be done ina 
more difficult microcomputer machine language 
or assembly language rather than the higher 
level |anguages offered with large computers. 
Furthermore, the debugging of these programs 
in prototype equipment can be more difficult 
than debugging conventional hardware, To 
offset these difficulties, a number of tools 
may be used. For example, assembler and 
simulator programs which operate on larger, 
more powerful computers may be used to 
simplify the program development and debugging 
stages. The assembler allows the programmer 
to write in a symbolic assembly !anguage 
rather than the direct machine language of the 
microprocessor. The simulator allows the user 
to observe the operation of a program as it is 
executed by simulating this execution on the 
larger computer. With these techniques, the 
development and debugging of microcomputer 
programs is greatly simp! ified, and need be 
no more difficult than a conventional hardware 
design. 


To better illustrate the uses for these 
microprocessors, consider the following design 
problem for which a microprocessor provided an 
ideal solution: 


Intel Corporation is a manufacturer 
of semiconductor memory products; 
included in the product line are a 
number of field-programmable read- 
only memory (ROM) devices. One 
family of these devices includes 

the |701, a 2048-bit sil icon-gate 
MOS ROM which can be both programmed 
and erased in the field. A second 


programmable ROM product is the 
5601, a 1024-bit bipolar ROM which 
utilizes a blown-silicon fuse 
technology for its implementation. 
To support these f iel d~programmab | e 
read-only memory products at 
factory sales offices and dis- 
tributors throughout the country, 
it is necessary for Intel to 
provide a ROM-programmer unit. 

The basic mode of operation for 
the ROMprogrammer should be to 
program new read-only memories 

from a paper tape or a previously 
programmed device. This ROM 
programmer should not only be 
capable of handling existing 
product categories, but should 

also be capable of handling» 

future read-only memory developments. 


Because new and quite different program- 
mable read-only memory products might be de- 
veloped in the future, the use of a micro- 
computer to build the ROM programmer seemed 
ideal. As a result, the Intel MCS-4 set was 
chosen as the basis for the design. 


The MCS-4 is a family of four integrated 
circuits which may be used to build a vari- 
ety of microprocessor configurations. The 
four members of the family include the 4004 
central processor, the 4001 mask-programmed 
read-only memory, the 4002 read-write memory 
and the 4003 10-bit shift register. The 
4003 is used primarily as an output expander. 
The central processor implements some forty- 
five instructions, each of which is executed 
in either 10.8 or 21.6 microseconds. In- 
structions typically manipulate four or 
eight bits of data. The microprocessor 
family communicates with outside. devices by 
means of 4-line input-output "ports" 
associated with each read-only memory and 
read-write memory. Read-write memory ports 
are output-only but read-only memory ports 
may be chosen for input or output at the 
time that the read-only memory program mask 
is taped. 


To make the hardware check out phase of 
an MCS-4 system more convenient, it is 
customary to interface 170! programmable 
read-only memory to the MCS-4 central 
processor as a substitute for the 400! mask 
programmed read-only memory. Using this 
interface, programs can be quickly encoded 
into the ROM, checked out and corrected 
without the tedious and expensive procedure 
of having the programs committed to masks. 
While it is customary for production systems 
to use the 4001 mask-programmed ROMs because 
of their economy, the decision was made to 
continue to use the 1701 type of read-only 


memory for the production ROM programmer units 
to make future equipment revisions more 
convenient. 


Figure | shows a photograph of the ROM 
programmer which was designated the 7600C. 
The mechanical construction of the unit is 
quite conventional. The main chassis is 
divided into two parts with the power suppl ies 
and paper tape reader located on the left 
Side and a printed-circuit card cage located 
on the right. The card cage holds some tive 
printed circuit boards with provision made 
for an additional interface board. Five basic 
boards are used in this system: a power supply 
board which contains the power supply regula- 
tors and over-voltage protection for the main 
system; an interface board which shifts vol tage 
levels as necessary for ROM's being programmed: 
and three boards which make up the computer 
system itself. Of these three boards, one 
contains the central processor, memory and 
clock generator circuits. The second contains 
the read-only memory array for the computer 
program and the third provides data bus 
buffering and circuitry to simulate the ports 
associated with the 4001's (that are being 
replaced by I70I!s). In addition to those 
boards in the card-cage, there is also a 
display panel board which includes an adapter 
for the device test sockets, as well as an 
address display, two data displays and an 
eight-bit status display. 


The front console of the unit includes a 
power switch and a tape winder switch, both 
of which are independent of the computer — 
system, an eight position mode selector 
Switch, and six push-button momentary contact 
switches. Although the mode switch and the 
push-button switches are labelled, these 
switches are merely inputs read by the . 
processor so their functions are determined 
exclusively by the program in the processor's 
read-only memory. Similarly, while the 
address, data and status displays are labelled, 
these displays are determined exclusively by 
the program. The control of the tape reader 
and the interpretation of the data read from 
the tape is also determined by the micro- 
processor program. Initial versions of the 
microprocessor program accept the tapes only 
in Intel's standard I60! format although a 
number of alternate formats have been proposed. 
Two mode switch positions have been reserved 
on the front panel for loading tape. These 
are presently labelled "load TELEX" and "load 
TWX" for five-wide and eight-wide tapes 
respectively. However, if the two widths are 
not required, one of these positions might be 
used to handle an alternate tape format or 
the microprocessor might be programmed to recog= 
nize One or more special characters which would 
signal the use of an alternate format for the 
tape data. 
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Figure 2 shows a block diagram of the 
microcomputer system and its interface to the 
device under test, system displays, and the 
panel controls. The 4004 central processor is 
wired directly to an array of sixteen 4002 
read-write memory units. In addition, a buffer 
circuit is provided so that the 4004 can 
fetch its program from the array of 1I70I 
programmable read-only memories. Output sig- 
nals from the 4004 are delivered at the 
output ports of the 4002 read-write memories. 
The buffer circuit logic also includes pro- 
visions for a number of input ports. These 
input ports are used to read the status of the 
front-panel switches and to read signals from 
the device under test. The interface for 
signals to the device under test and the dis- 
* play interface both use the 4005 output 


a expander. The multiplexing inherent in the 
“use of the 4003 reduces the number of wires 


_which must be routed within the chassis. For 


“> example, although the display includes 32 
ve. individual Light emitting diodes and four 


' digits of seven segment display, only twelve 
signals leads run from the microprocessor to 
the display. Not all timing signals for the 
~ device under test are generated by 4002 
memory ports, because the 4004 executes 
instructions no faster than one per 10.8 
microseconds. Where faster timing signals 
must be generated for the device under test, 
these are done on the interface board for the 
device. As was described above, this inter- 
face board also includes level shifters and 
any special regulators required to power the 
‘device under test. It should be noted that 
“while the interface board does implement some 


«of the critical timing functions for testing 


the device, most of the timing functions are 
‘implemented directly under computer control. 
For example, when programming |70I's, the 
control of the power supply and program pulses 
are done by the microprocessor. 


At the time of this writing, the program 
implements eight modes of operation. Two of 
the modes are used to load teletype tapes into 
the (4002) buffer memory; one mode for five 
track TELEX tape and the other mode for eight 
track tapes in ASCII format. A third mode 
permits the loading of the buffer memory from 
a device in the testsocket. (This operation 
is much more rapid than that of loading paper 
tape). A fourth mode programs the test device 
and the fifth mode is used to read and check 
the test device. The sixth and seventh modes 
are intended for use with an external teletype. 
They allow listing the contents of the memory 
or the contents of the test device. If the 
teletype used has a paper tape punch, either 
of these modes can produce a tape which is 
readable in TWX mode. The last mode provides 
for manually altering the contents of the 
buffer memory and/or manually programming the 
device plugged into the test socket. 
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The six momentary contact push-buttons 
are used to control operation within a mode. 
Three of them are active in all modes of 
operation. They are the reset, stop/increment, 
and start/continue buttons. Three other 


push=-buttons labelled "I" and "0" and "write" 
are used primarily in manual mode. Pressing 
the "0" or "I" buttons shifts the corresponding 


datum into a data input buffer, while the 
write push-button causes this data to be 

programmed into the test device and to be 
deposited into the internal buffer memory. 


The program which implements these eight 
modes of operation with the full push-button 
contro!, for the two programmable ROMs 
occupies eight 1701 ROMs or about 2,000 bytes 
of read-only memory. The read-only memory 
card for the microprocessor has provision for 
up to sixteen I70!'s, f.e., at this state of 
the program approximately one-half of the 
capacity of the system is used. 


The use of a microprocessor has had some 
interesting effects on the 7600C system, In 
a few cases, a change which would have meant 
moving only one or two wires in conventional 
hardware has required several hours for 
reassembly. (Assembly of programs is done on 
an in-house PDP-IO timesharing system at 
Intel and requires approximately 2 to 3 
minutes of computer time. Editing changes 
required for minor alterations usually takes 
but a few minutes to perform, However, once 
the assembly is complete, tapes for reprogram- 
ming the 1701's must be generated and these 
programmable ROMs must have the new data 
encoded into them.) However, many changes 
which would have represented several days of 
rewiring and extensive alterations to the 
printed circuit layout have taken only the 
same few hours to implement. For example, 
an early revision of the manual mode of 
operation did not display the original contents 
of the buffer memory. Adding this display 
capability modified only two locations of one 
of the 170! ROMs. Total turnaround time for 
this change was perhaps one hour. Had it been 
necessary to perform this change in hardware, 
it would have required extensive wiring changes 
within the chassis and the alteration of one 
or more of the printed circuit boards. Another 
feature that was added by very minor change 
in the program was the ability to set the 
memory to all I's or to all O's. This was 
implemented as follows: 


With the ROM programmer in the load DUT 
mode, pressing the start button still loads 
the contents of the test device into the 
buffer memory. However, if the | or the O 
button is pressed, the entire buffer memory 
will be loaded to that value. 


The turnaround times mentioned for these 
changes refer to the time elapsed from the 
"design" of the change to its first imp | emen- 
tation. Once the change is debugged, any 
field model of the ROM programmer could be 
changed merely by replacing one or more of the 
eight 1701 ROM's on the ROM-printed circuit 
board. 


While this model of the ROM programmer 
uses the I70I for all of its program memory, 
other systems might use combinations of 400! 
mask-programmed ROM and 1701 field- 
programmable ROM for greater economy. 
technique is to utilize the subroutine 
capability of the MCS-4 by "hardening" many 
of the subroutines into conventional ROM 
while leaving the mainline program in 
programmable ROM, 


One 


CONCLUSIONS 


The use of a microprocessor in a piece 
of test equipment for use with field- 
programmable ROMs simplified the intérnal . 
design of the unit and greatly increased the 
flexibility and capability for future modi-~ 
fications. The turnaround time for changes 
in the prototype are usually on the order of 
a few hours, relatively independent of the 
complexity of the changes designed. However, 
when such a change is made, units in the field 
may be altered by merely replacing the ROMs 
containing the program for the microcomputer. 
Extensive use has been made of the subroutine 
capability of the MCS-4 microcomputer set. 

As a result, most changes usually involve only 
One or two ROM chips. The use of these 


“products does require design approaches but 
the advantages far outweigh the disadvantages 
associated with learning these new techniques. 


Fig. |. Photograph of ROM Programmer Using MCS~4 
R/W Memory 
Displays 16 
4002s 
From ROM-Port Data-Bus 
Console Simulator Buffer 
Switches i. 
To 
Device Interface 1701 Array 
Under Circuits (Control Program) 
Test 
Figa 2. 


Block Diagram of ROM Programmer using MCS-4 
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MICRO COMPUTER APPLICATIONS OF 
ELECTRICALLY ALTERABLE ROMs 


by H. FEENEY 
INTEL CORP. 3065 BOWERS AVE. SANTA CLARA, CALIFORNIA 95051 


SUMMARY 


Micro computer systems (combina- 
tion of LSI micro processors, program 
storage, ROM, and data storage, RAM) 
provide a new tool for the system de- 
signer. At the present time, both mask 
charges and turn-around time for mask 
programmable ROMs are a barrier to rapid 
system development. The electrically 
programmable and ultra violet erasable 
ROM providesthe solution to this prob- 
lem. This paper describes the applica- 
tion of alterable read-only-memories to 
micro computer systems. 


INTRODUCTION 


In 1971, a fully-decoded 2048 bit 
electrically programmable and ultra vio- 
let erasable read only memory was intro- 
duced by Intel Corporation. This memory 
used a novel floating-gate avalanche- 
injection MOS (FAMOS) charge stored 
device as the basic memory element. 
Later, during the same year, Intel intro- 
duced LSI micro processors. Now, com-~ 
plete micro computer systems can be de- 
veloped which offer system flexibility, 
design expediency and manufacturing eco- 
nomy. It is in the areas of both flexi- 
bility and expediency where the electri- 
cally programmable ROMs (PROMs) play an 
important part. 


FAMOS MEMORY 


The FAMOS programmable and erasable 
memory was first explained by Dr. D. 
Frohman-Bentchkowsky at the 1971 IEEE 
International Solid State Circuits 
Conference: 


"The operation of the FAMOS mem- 
ory structure depends on charge 
transport to the floating gate 
by avalanche injection of elec- 
trons. Charge can be transfer- 
red to the floating silicon gate 
if an avalanche injection condi- 
tion is reached in either the 
source or drain junction...In a 
P-channgl FAMOS transistor with 
a 1000 A thick oxide, an applied 
junction voltage of -30v is re- 
quired for the onset of avalanche 
injection. The gate charging 


current is of the order of 107! 


A/om”. Since the silicon gate 
is flaating, the avalanche in- 
jected current results in the 
accumulation of a negative 
electron charge on the gate. 
For a P-channel FAMOS transis- 
tor, this negative charge will 
induce a conductive inversion 
layer connecting source and 
drain. The amount of charge 
transferred to the floating 
gate is a function of the am- 
plitude and duration of the 
applied junction voltage. 

Once the applied junction vol- 
tage is removed, no discharge 
path is available for the ac- 
cumulated electrons since the 
gate is surrounded by silicon 
oxide which is a very low con- 
ductivity dielectric...Since 
the gate electrode is not elec- 
trically accessible, the charge 
cannot be removed by an elec- 
trical pulse. However, the 


initial equilibrium condition 
(no electronic charge on the 
gate) can be restored by illum- 
inating the unpackaged device 
with ultra violet light or by 
exposure of the packaged device 
to X-ray radiation." 


FLOATING S! GATE DEPLETION REGION 


N-TYPE Si 


Figure 1. Cross-section of a Floating- 
gate Avalanche injection MOS (FAMOS) 
device. 


When this PROM is used as a system 


development tool, the ability to erase 
the memory is of primary importance. 


The FAMOS memory may be erased and com- 


pletely reprogrammed in less than one 
half hour. Figure 2 (next page) shows 
the simple erasing procedure using a 
high intensity short-wave ultra violet 
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light. The FAMOS memory is neither a4f- 
fected by sunlight nor by flourescent 
lighting. 


Figure 2. Erasing of FAMOS PROM 
(Quartz Lid Package). 


MICRO COMPUTERS 


LSI technology now provides addi- 
tional new components for the system 
designer -- micro processors. These are 
complete computer central processing 
units which include an arithmetic unit, 
index register storage, program counter, 
address stack, instruction decoding, and 
system timing and control on a single 
Silicon substrate. When combined with 
RAMs for data storage, complete micro 
computer systems can be assembled. 

These components provide the same 
flexibility as the minicomputer, but 
they extend the concept of the dedicated 
control computer into applications where 
the minimization of cost and physical 
size are important, but where processing 
speed is not a major factor. 

The power of a general purpose com- 
puting system is now available to every 
system designer as an alternative to the 
conventional design of random logic sys- 
tems. Since micro computer systems are 
custom tailored by programming of ROMs, 
they,are also an alternative to custom 
LSI. 

Using as few as two micro computer 
components (CPU and ROM), many of the 
same control and computing functions 
of the minicomputer are available at a 
cost of nearly two orders of magnitude 
less. 

Two micro processors ,currently 
available are the 4004,~'’° a four-bit 


parallel CPU designed for control and 
decimal arithmetic applications, and the 
8008,” an eight-bit parallel processor 
for data handling. The four-bit CPU 
(4004) interfaces directly with special 
ROMs (mask programmable 4001) and spe- 
cial RAMs (4002). On the other hand, 
the eight-bit CPU (8008) may be used 
with standard ROMs (mask programmable 
1301) and standard RAMs (1101, 1103), 
but it requires some TTL interface cir- 
cuitry. 

In both instances, the final sys- 
tems will be formed by the appropriate 
combination of standard LSI building 
blocks. For the system development 
phase, electrically programmable and 
ultra violet erasable ROMs should be 
used instead of the metal mask program- 
mable ROMs. 


MICRO COMPUTER SYSTEM DEVELOPMENT 


System development is always a 
tedious, iterative task. The use of 
micro computers moves much of the design 
from hardware to firmware. Most itera- 
tions in design then become ROM program 
modifications, not component and board 
changes. 
ters have been designed and ready for 
production in less than three months. 

A flowchart for the development e 
of a typical system is shown in Figure 3. 
(see next page). The general complexity _ 
of a system will be established by both 
the I/O requirement for data and the 
number of I/O lines required for’ peri- 
pheral controls. The RAM data storage 
requirement must also be determined, 
and sample programs must be written to 
determine ROM program storage require- 
ments. At this time, the basic hardware 
is fixed and it can be developed concur- 
rent with the firmware. 

When the system control programs 
are complete, they can be assembled — 
using a general purpose computer, and . 
tapes for the actual programming of FAMOS 
PROMs can be generated. After the PROMs 
have been programmed, the system can then 
be debugged. Subsequent iterations of 
the system program may be required. 

After the final iteration, the 
program can be committed to metal mask 
ROMs for the high volume production run. 
Concurrently, systems using PROMS can 
be built for initial deliveries. In the 
case where only low volume or one-of-a- 
kind customized systems are required, 
the system can be delivered using PROMs. 


6-43 


Many systems using micro compu- : 


DEVELOPMENT OF A MICRO COMPUTER SYSTEM 


Determine 1/O 
Requirement 
Determine RAM 
Requirement 


Write Sample Programs: 
Determine ROM Requirement 


Develop System Program 
7 Using FAMOS PROMs 
Program 


Operational ? 


High 
Volume 
Production? 


Order Metal 
Masked ROMs 
for High Volume 

Production 


Customize 
Individual Systems 
_ Using 
FAMOS PROMs 


Build Pre-production 
Systems Using 
FAMOS PROMs 


Figure 3. 


A typical prototyping system is 
shown in Figure 4. This board uses the 
four-bit CPU (4004) and associated RAM 
memory (4002). FAMOS PROMs and TTL 
interface logic is used to simulate the 
mask programmed ROMs (4001). This sys- 
tem forms a complete micro computer. 

In addition to its use as a prototype 
system, this board may be used to pro- 
vide the control for a PROM programming 
system. Using three special control 
ROMs, this micro computer can read data 
from a teletype, examine data format, 
present addresses and data to a FAMOS 
PROM programmer, and activate the pro- 
gram sequence (Figure 5). A similar 
micro computer prototype system is used 
with the eight-bit micro processor. 
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Figure 4. 
PROMs. 


Micro Computer System using 


CONTROL PROGRAM 


| i | | 


SIM@01 OR SIME O2 


ROM PROGRAMMING BOARD 


MP? Ot OR MP7 02 


i 
PROM SOCKET 


ROM SOCKETS if 
Try 
ASR 33 


PROM Programming System. 


Figure 5. 


RELIABILITY 


One of the most important perfor- 
mance factors in using both electrically 
programmable ROMs and micro computers is 
long term reliability. Both devices are 
fabricated using Intel's standard 
P-channel silicon gate technology of 
proven reliability. The only additional 
question posed by the use of the FAMOS 
PROM is the stability of the "1" memory 
state and the stability of the "0" mem- 
ory state. The extrapolation of the 


charge decay results based on Dr. 
Frohman-Bentchkowsky's initial measure- 
ments” indicate that 70% of the inital 
induced charge ig retained in excess of 
ten years at 125°C. 8 

Subsequent Intel testing of dis- 
crete devices, 16-bit memory elements, 
and the fully decoded 2048-bit memories 
confirms the reliability of this device. 
The worst case condition for the "0" 
state occurs when the device is subject- 
ed to high temperature storage at 125°C. 
There have been no failures in over 1.5 
million unit hours (67 million bit 
hours) of testing. The worst case con- 
dition for the "1" state occurs when 
voltage is applied to the charge-storage 
device. There have been no failures in 
over 1.7 million unit hours (3.4 billion 
bit hours) of life testing. 

FAMOS memory devices have also been 
exposed to sunlight and to flourescent 
light to determine the effect on data 
retention. In neither case was there 
a loss of stored data. 


CONCLUSION 


PROMS are currently being used in 
hundreds of micro computer develgpment 
programs. Systems in production or 
undergoing field testing which use PROMs 
for system control are in the areas of 
process control, inventory control, data 
acquisition, point of sale terminals, 
intelligent terminals, and many more. 

Much of the success and acceptance 
of the micro computer system is due to 
the availability of the electrically 
_ programmable and ultra violet erasable 
PROMs. 
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IMPACT OF LSI TECHNOLOGY ON SLIDE RULES, CALCULATORS, AND MINICOMPUTERS 


IMPACT OF LSI ON MICRO COMPUTER 
AND CALCULATOR CHIPS 


by H. SMITH 
INTEL CORP. 3065 BOWERS AVE. SANTA CLARA, CALIFORNIA 95051 


I. INTRODUCTION 


The complexity of the integrated 
circuit has been doubling every year, 
' yeaching a level today where it is 
possible to integrate complete com- 
puter central processors and calcu- 
lators on a single chip. The avail- 
ability of these tiny, low cost chips 


. is already producing a revolution in 


system design and rapidly changing 
the architecture of many systems. 


These LSI micro computer and calcu- 
-lator chips are classified as stan- 
dard random logic arrays which do 

- mot have the regularity of LSI mem- 
ory chips (where a single cell is 
repeated a number of times to form 
an array). This poses some unique 
design problems in both the area of 
product definition and product real- 
-dzation. First, the chip must be 
“defined and partitioned in such a way 
that it will be sufficiently univer- 
sal to have wide usage; and secondly, 
the chip size must be kept small 
enough to be able to produce it in 
large volume economically. Ina ran- 
dom logic array, the chip size will 
be more a function of the method of 
partitioning and the amount of inter- 
connection, rather than of cell size 
(as is the case with LSI memory). 


The Si-gate MOS process has had a 
‘tremendous impact on the design of 


-- yandom logic chips because it allows 


the high component and interconnect 


- @ensities that are required to eco- 


nomically produce these chips. With 
this technology, a complete central 
processor has already been put ona 


Reprinted from NEREM 72 Record; copyright 1972 by the 
Boston Section of the J/EEE. 
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single chip, and a complete computer 
has been implemented with a few LSI 
packages. 


This paper will briefly discuss the 
Si-gate MOS process and the impact 
that it is having on the high volume 
production of low cost, random logic 
LSI chips. It will then look speci- 
fically at several micro computer and 
calculator chips that are being pro- 
duced using this process. These chips 
will be compared with equivalent chips 
that have been manufactured with the 
metal-gate process. 


II. MOS SILICON-GATE TECHNOLOGY 


Briefly, the process steps are as 
follows: 

1. The starting Material is N-type 
silicon for P-channel or P-type for 
N-channel. 

2. The wafer is first placed in an 
oxidizing atmosphere at high temper- 
ature and a relatively thick layer 
(about 14Mm) of Sid. is grown on the 
surface. 
3. The region for the source, drain 
and channel is then defined by photo 
masking and etching. 

4. The wafer is again placed in an 
pia atmosphere and a thin layer 
of SiO, (.l1am) is formed. This thin 
layer éf g SiO.,will serve as the gate 
dielectric. 

5. Next, a thin layer of polycrys- 
talline silicon is deposited over the 
entire wafer. | 
6. The wafer is then returned to 
photo masking for removal of the sili- 
con layer except where the gate re- 
gions are defined or where the sili- 


con film will be used as an intercon- 
nection. The thin oxide is then re- 
moved by exposing the wafer to an 
oxide etch. 

7. The wafer is then placed in a dif- 
fusion furnace where boron (for P- 
channel) or phosphorous (for N-chan- 
nel) is diffused into the gate, inter- 
connect, source and drain regions. 

8. Next, a thick layer of oxide is 
deposited over the entire wafer and 
openings are etched for contacts be- 
tween the subsequent metalization and 
underlying diffused regions or the 
polycrystalline silicon interconnect 
level. 

9. Aluminum is then evaporated over 
the entire surface and is etched to 
define the metal pattern. 

10. The process is then completed 
with the coating of a layer of glass 
over the entire chip and the etching 
of the contact holes for the external 
‘interconnect. 


The silicon-gate process has many in- 
herent advantages over the convention- 
al metal-gate MOS process. These ad- 
vantages have made possible the fab- 
rication of low cost random logic LSI 
devices. The most dramatic evidence 
of this has been the introduction of 
several computer-on-a-chip devices 

by Intel in late 1971 and early 1972. 
The advantages of the Si-gate process 
are: 

l. Smaller and faster devices result- 
ing from (a) the se aligned gate, 
and (b) elimination of metal-to-metal 
separations between the gate and the 
source or drain. 

2. Circuit interconnection flexibil- 
ity resulting from the use of depo- 
Sited polycrystalline for intercon- 
nection with additional use of dif- 
fused conductors in the substrate, as 
in other MOS technologies. This gives 
flexibility approaching that of three 
layers for interconnecting complex 
functions efficiently. These three 
interconnections are illustrated in a 
magnified view of a portion of the 
8008 8-bit CPU chip shown in Fig. l. 
THE AREA SAVED OVER CONVENTIONAL METAL 
GATE MOS TECHNOLOGY IS APPROXIMATELY 
508. 


The impact of this Si-gate LSI techno- 


logy on the production of low cost 
micro computers and calculator chips 


Nae? ok 


iy @c- 


ERE AETE 
Or a Pg OH 


x Near ed 


Figure 1. Photomicrograph of a small 
region of Intel's 8008 8-bit CPU chip 
showing the three possible modes of | 
interconnection: (a) diffused regions 
within the wafer, (b) deposited sili- 
con, and (c) metal interconnecting 
lines. 


will now be illustrated. 


III. USING THE SI-GATE PROCESS TO 
BUILD MICRO COMPUTER CHIPS 


In November, 1971, Intel ushered in 

a new era of integrated electronics 
by introducing its first micro com- 
puter on a chip -- the 4004 4-bit 
CPU. In May, 1972, Intel introduced 
its second micro computer on a chip 
-~- the 8008 8-bit CPU. Both of these 
devices are fabricated and made eco- 
nomically possible with the P-channel 
Si-gate process. The announcement of 
these chips brought the power of a 
general purpose computer to every . 
systems designer as an alternative to 
conventional designs of random and 
custom logic systems. 
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4004 4-Bit CPU Chi 
The 4004 is the heart of the MCS-4 


general purpose, micro programmable 
computer set. When used with other 
members of the micro computer set 
(4001 -- 256x8 bit ROM and 4-bit I/0 
port, 4002 -- 320 bit RAM and 4-bit 
output port, and 4003 -- I/O expand- 
er), an almost unlimited number of 
types of systems may be built. 


The 4004 CPU chip consists of a 4-bit 
adder and accumulator, a 64-bit index 
register, a memory stack containing 
a 12-bit program counter and three 


Oo Udebit words used to store subroutine 


addresses, an address incrementer, an 
. B-bit instruction register and decod- 
. er, and control logic. (The block 
diagram is shown in Figure 2.) The 
characteristics of the chip are: 
Chip Size -- 117x159 mils, approx. 
. . 18,500 square mils 
Number of transistors -- 2248 
Package -- 16 pin DIP 


cm CM CM CM 
Yoo Vcc RAMy RAM, RAM, RAM; 
9 9 9 9 


CM-RAM 
OUTPUT SUFFERS 
COMMAND 
CONTROL 
REGISTER 


Figure 2. Block Diagram of Intel 
. 100d Ibit CPU Chip. 
8008 8-Bit CPU Chi 
The 60006 CPU, when combined with stan- 


dard memory devices, forms the MCS-8, 
general purpose micro computer set. 


“The 8008 CPU chip consists of an 8- 
bit parallel binary arithmetic unit 


-. and accumulator, six 8-bit data re- 


gisters, two 8-bit temporary regis- 
ters, four flag flip-flops, a memory 
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stack containing a 14-bit program 
counter and seven 14-bit words used 
to store subroutine addresses, in- 
struction register and decoder, and 
control logic. (The block diagram 
is shown in Figure 3.) The charac- 
teristics of the chip are: 

Chip Size -- 124x173 mils, approx. 

21,400 square mils 
Number of transistors -- 3098 
Package -- 18 pin DIP 


If we compare this CPU chip with an 
equivalent CPU chip that performs 
exactly the same functions and is 
fabricated in the metal-gate process, 
the advantages of the Si-gate LSI 
process can be clearly seen. 


(Mils) (Sq. Mils) 
124 x 173 |21,400 
215 x 225 148,500 


THE METAL-GATE CHIP IS MORE THAN 2.2 
TIMES THE SIZE OF THE SI-GATE CHIP. 


Figure 3. Block Diagram of Intel 8008 
-Bit CPU Chip. 


IV. USING THE SI-GATE PROCESS TO 
BUILD CALCULATOR CHIPS 


The last example that will be used to 
illustrate the impact of the Si-gate 
MOS process on random logic LSI chips 
is the single chip calculator. Intel 
has developed and has in production 


a single chip calculator that is be- 
ing fabricated with the P-channel Si- 
gate process. The functions and fea- 

tures of the chip are: 


Operations performed: addition, 
sibtraction= multiplication, divi- 
sion, chain multiplication, chain 
division, constant multiplication, 
constant division, percentage 
calculation 

Display: 10 or 12 digits 
Features: floating point, over- 
flow indication, credit balance, 
zero suppression 


The size of this chip is 132 x 128 
mils, or approximately 17,000 square 
mils. A comparison with equivalent 


single chip calculators is shown 
below. 


CHIP CHIP 
AREA 


Mils) (Sq. Mils) 


ae 
230x230] 53,000 
180x180; 32,400 


THE CONVENTIONAL METAL-GATE CHIP IS 

3 TIMES THE SIZE OF THE SI-GATE 

CHIP AND THE ION-IMPLANTED METAL-GATE 
-CHIP IS NEARLY 2 TIMES THE SIZE OF 
THE SI-GATE CHIP. 


digit floating 
point) 


ompany T single 
hip calculator 
(conventional 

etal-gate) (8- 
digit floating 


ompany M single 
hip calculator 
(Ion implanted 
etal-gate) (10 
digit floating 


V. CONCLUSION 


Si-gate LSI has made possible the 
production of micro computers and 
calculators on a single chip. The 
availability of these tiny, low cost 
computer chips is producing a revolu- 
tion in system design similar to that 
produced by the mini computer. The 
range and applications for the micro 
computer are very broad and the ar- 
chitecture of many systems is being 


changed to make effective use of 


these devices. Already, a large num- 
ber of cash registers, point of sale. 
terminals, traffic control systems, 
small accounting machines, process 
controllers, intelligent typewriters, 
digital scales, digital instruments, 
test equipment, etc. have been built 
using these new processors. 


The future will bring higher speed 
and more sophisticated LSI computer 
chips and we will see an even greater 
proliferation in the use of these 
devices. It is not unreasonable to 
project that computer chips will be 
routinely used in future design as 
TTL MSI is used today. The impact 

on system design will be truly phe- 
nomenal. 
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THE NEW LSI COMPONENTS 


by M. E. HOFF — MANAGER, APPLICATIONS RESEARCH 
INTEL CORP. 3065 BOWERS AVE. SANTA CLARA, CALIFORNIA 95051 


For several years, many authors have discussed 
techniques for partitioning logic designs to make 
them compatible for large scale integration. In 
the meantime, the complexity of the most econom- 
ically optimum integrated circuit has been 
doubling every year. With this rapidly growing 
complexity, it has now become possible to 
integrate many systems completely on a single 
chip. A large number of single-chip control 
circuits and single-chip desk calculators are 
already in service. 


The high complexity of the modern integrated 
circuit results in high development costs and 
raises the question of how to define the circuit 
in such a way that it will be sufficiently 
universal to have a wide usage. Wide usage is 
necessary or the development cost,. even when 
amortized over the total number of devices pro- 
duced, may make the circuit uneconomical when 
compared with more conventional logic circuits. 
Unfortunately, not all circuits have the wide 
usage or universal ity of function that is charac- 
teristic of the desk calculator. However, the 
high complexity available with modern integrated 
circuits makes possible the construction of a 
relatively universal component, the general 
purpose digital computer. At the present state 
of the technology, a computer central processor 
somewhat like that of a minicomputer can be 
integrated on a single silicon chip. While the 
performance of these computers is not yet equal 
to that of the minicomputer constructed with more 
conventional design, there are many appl ications 

“where the capabilities of these single-chip com- 
puters are quite adequate. 


The availability of these tiny, low cost 
microprocessors can be expected to produce a 
revolution in system design greater than that 
produced by the minicomputer, With the advent of 
the minicomputer, it became possible to use a 
digital computer as a component in larger systems. 
Using a minicomputer as the heart of a system 
‘offers a degree of flexibility that no ordinary 
hard-wired system could have, Because of the 
still significant size and cost of the typical 
minicomputer, however, its use as a component has 
been limited to relatively large and expensive 
systems, The new LSI microprocessors, however, 
remove most of the cost and size restrictions 
associated with the use of a digital computer as 
the heart of a system, 


The range and application for the new com 
ponents is very broad. Already, a large number 
of cash registers, point of sale terminals, 
computer terminals, traffic control systems, smal| 
accounting machines, intelligent typewriters, 
process controljiers, numerical machine tool con- 
trollers, etc., have been built using the new 
microprocessors. In some cases, the use of these 
microprocessors greatly simplifies the overhead 
associated with a design. For example, many | 
organizations were already designing simplified 


low-performance minicomputers for use in smal 
systems after having recognized the advantages 
of a programmable system rather than a hard-wired 
one. However, any processor design has a signi- 
ficant cost associated with the development of 
the software for the system. Minimum requirements 
for the support software usually include an 
assembler to permit programming in a symbol ic 
language and a simulator to allow programs to be 
checked out on a larger computer. In some cases, 
a microprocessor has been chosen because a wide 
variety of functions can be implemented with a 
single unit of hardware. Modifying equipment for 
a given customer's needs is also easier with the 
programmed structure. 


Many users of microprocessors have chosen 
the approach because the system is simplified as 
well as increased in flexibility. While not 
always true, there are many functions which are 
quite difficult to realize with hardware, but are 
quite easily implemented with a microprocessor. 
However, it should be noted that some functions 
which are fairly easy to perform with hardware 
can be more difficult to perform with microproces- 
sor. The most common difficulties arise when the 
designer converts from an essentially parallel 
hardware network to the sequential microprocessor 
imp!ementation. 


A potential difficulty faced by the manufac- 
turer who would use a microprocessor to implement 
his system is caused by the requirement for 
programming. Although programming experience is 
much more common among engineers today than it 
was a few years ago, a significant nunber of 
engineers are still unfamiliar with programming 
techniques, particularly in machine or assembly 
language. For the available microprocessors, 
however, the instruction sets are relatively 
simple. As a result, they are easily !earned and 
quickly remembered. Some engineers have found it 
helpful to think of the instructions as building 
blocks of the system much as MS! elements are 
building blocks of a hard-wired logic system, Most 
of the designers who investigate these micro- 
processors feel it wel! worth it to make this 
investment. for not only does the microprocessor 
make it possible to build @ more powerful, more 
capable system but it also offers a number of 
other advantages. For example, because so much 
of the system's characteristics are determined by 
the program for the microprocessor, it is usually 
quite convenient to make changes in the system by 
changing this program. While it is customary when 
using these microprocessors to commit most of the 
program to read-only memory so that the program 
does not have to be loaded each time the system is 
turned on, it is still quite a bit more convenient 
to make changes by replacing or altering the 
contents of the read-only memory than it is to 
have to cut traces on printed circuit boards, 
move and add wires and back plane wiring, etc. 

It is also usually easier to provide for future 
expansion of a system when this expansion wil | 
take the form of enhancement or additions to a 


Reprinted from Digest of Papers, 1972: Sixth Annual {EEE Computer Society International Converence; copyright 1972 by the IEEE. 
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a program than it is when the expansion will in- 
volve changes of hardware. 


As in the case of any new component, the 
designer must learn to use the item efficiently. 
Because of the low cost of the processor itself, 
it becomes most desirable to minimize the cost of 
the peripheral circuitry around the processor. 

For example, in one application using the Intel 
MCS-4 single-chip computer family, it was neces- 
sary for the processor to be able to determine the 
value of an analog voltage. While it was possible 
to use the conventional approach of interfacing an 
analog to digital convertor to the microprocessor, 
a cost saving was achieved by having a micro- 
processor execute a program which enabled a digital 


to analog convertor and a comparator to perform the 


analog digital convertor function. Figure | 
shows how the conversion was achieved. The MCS-4 
uses a "port" for input/output communication. A 
four-wire port is associated with each read-only 
memory or read-write memory chip. (It should be 
noted that the MCS-4 microcomputer set is a family 
of four parts consisting of the 4004 central 
processor unit; 4001 read-only memory; 4002 
read-write memory; 4003 shift register/output 
expander.) In Figure |, two of these output ports 
have been used to drive the inputs of a digital 
to analog convertor (DAC). The DAC is wired to a 
comparator which allows the output of the DAC to 
be compared with the analog input signal. The 
output of the comparator is in turn wired to the 
test input of the 4004 central processor. This 
test input line is interrogated when the central 
processor executes a certain conditional jump 
instruction. Whereas the normal instruction 
execution flow within the MCS-4 system is sequen- 
tial through program memory, when the conditional 
jump is executed, the processor jumps to a new 
location in memory, starting a new instruction 
sequence. 


Figure 2 lists the program for the analog to 
digital convertor in MCS-4 assembly language (for 
those readers desiring more information about the 
instruction set of the MCS-4, they are referred 
to reference (I1)). The program implements a 
successive approximation conversion technique. 
Starting with the highest order bit, each bit in 
turn is turned on and the output of the comparator 
tested. If turning on the bit results in a signal 
from the DAC that is larger than the analog input, 
the bit is turned off and the next bit in turn 
tested. However, if turning on a bit leaves the 
output of the digital-to analog convertor stili 
smaller than the. analog input signal, then that 
bit will be left turned on. The coding for the 
program consists of testing each of the lines of 
one port in turn using in-line coding, then 
repeating the sequence for the next set of port 
lines by looping back. Setting a bit is 
accomp! ished by loading the accumulator with a 
load immediate instruction (LOM) and then writing 
the contents of the accumulator to the output 
port. The output port is selected at the begin - 
ning of the program by the combination of fetch 
immediate (FIM) and send register contro! (SRC) 
instructions. Register #4 (R4) ts used to contain 
the current estimate of the value for the 4-bits 
being tested. A bit under test is retained or 
cleared by updating or not updating the contents 


of register 4. At the end of the basic 4-1 ine 
test sequence of instructions, the contents of 
register 4 are saved in an alternate location by 
a series of exchange (XCH) instructions and the 
instruction increment and skip on zero (1SZ) is 
used to perform the function of counting the 
number of passes through the loop and jumping 
back to the loop start. The loop selects the 
next port in turn by the increment (INC) instruc- 
tion which modified registers RO so that when the 
next SRC instruction is executed, it wilf select 
the next port in sequence. This basic program 
can be easily modified to handle I2 bit btnary 
or 2 or 3 digit decimal conversions.* Execution 
of the sequence of instructions takes fess than 
one millisecond and as can be seen from the 
fisting, occupies some words of read-only 
memory. 


A multiplexer for multiple analog inputs 
can be added quite easily by providing a separate 
comparator for each analog input and performing _ 
digital multiplexing at the input to the test 
terminal of the 4004 central processor. An 
alternate use of the structure shown in Figure | 
permits determining which if any of the several 
signals is above or below some predetermined 
analog threshold value. The analog threshold 
value is deposited at the output ports driving 
the DAC and the outputs of the comparators are 
then read into the MCS-4 system at an input port 
or at the test terminal of the CPU, 


This example illustrates a typical appl i- 
cation for some of the new single-chip micropro- 
cessors, At the time of this writing, there are 
two such microprocessors available, the Intel : 
MCS-4 4-bit processor system and the Intel MCS-8 
8-bit processor system. Although both of these — 
processors are produced by one manufacturer, _ 
several other semiconductor manufacturers have 
announced their intention to produce micro- 
processor-like structures. With the continuing 
growth in the complexity of integrated circuits, 
the potential user of these devices can expect 
faster, more powerful microprocessors will 
become available in the future. However, two 
families of these processors are already ‘ 
available and are now finding wide acceptance. 

A designer who becomes familiar with their 
characteristics probably will find it easier to 


. adapt to new microprocessors and will. be in a 


better position to influence the design of the 
next generation of microprocessors, 


|, MCS-4 Microcomputer set -- Users Manual, 
Intel Corp., Mar 1972 


2. 8008 8&bit Parallel Central Processor Unit = 


MCS-8 Microcomputer set, Inte! Corp., 
_ April 1972 
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e DIGITAL MULTIPLEXER 


ANALOG ANALOG 
INPUT INPUT 


/SET UP FOR SELECTION OF ROM OUTPUT PORT (RO, RI=PO), 
USING RI /AS A LOOP COUNTER -- VALUES IN BINARY 
0000 00032 FIM PO 000011118 
00015 
/CLEAR REGISTERS R4, R5. (THESE TWO REGISTERS ARE 
/DESIGNATED PAIR 2 OR P2 BY THE FIM INSTRUCTION). 
R4 AND R5 /WILL BE USED TO RECEIVE THE RESULT OF THE 
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CONVERSION 
0002 00036 FIM P20 
00000 
/START OF MAIN LOOP 
0004 00033 ADLP, SRC PO /SELECT PORT USING CONTENTS OF RO, RI 
0005 00240 CLB /CLEAR ACCUMULATOR AND CARRY FLIP-FLOP 
0006 00216 LOM 8 /LOAD ACCUMULATOR WITH 1000 
/LDM 8 SETS THE HIGH ORDER BIT OF THE ACCUMULATOR 
0007 00226 WRR WRITE ACCUMULATOR TO ROM OUTPUT PORT 
0008 00025 JCN TI *+3 /SUMP PAST SCH IF RESULT TOO BIG 
00011 , 
0010 00180 XCH R4 /SAVE RESULT IF NOT TOO BIG 
/NOW REPEAT FOR 2ND HIGHEST BIT 
0011 00212 LOM 4 /LOAD ACCUMULATOR WITH 0100 
0012 00132 ADD R4 /ADD RESULT OF PREVIOUS TEST 
0013 00226 WRR WRITE TO ROM OUTPUT PORT 
0014 00025 JCN TI *4+3 /JUMP PAST XCH IF RESULT TOO BIG 
00017 
0016 00180 XCH R4 /SAVE CURRENT RESULT IF NOT TOO BIG 
/REPEAT PROCEDURE FOR LAST TWO BITS OF THIS PORT 
0017 11210 LDM 2 {LOAD ACCUMULATOR WITH 0010 
0018 00132 ADD R4 
0019 00226 WRR 
0020 00025 JCN Ti *4+3 
00023 
0022 00180 XCH R4 
0023 00209 LDM 1 /LOAD ACCUMULATOR WITH 0001 
0024 00132 ADD R4 
0025 00226 WRR 
0026 00025 JCN TI *+3 
00029 
0028 00180 XCH R4 
/NOW WRITE FINAL RESULT TO ROM PORT 
0029 00164 LD R4 /LOAD FINAL RESULT TO ACCUMULATOR 
0030 00226 WRR WRITE TO ROM OUTPUT PORT 


/NEXT MOVE THESE 4 BITS TO R5 AND CLEAR R4 AND CLEAR R4 FOR NEXT PASS 
/NOTE R5 INITIALLY CONTAINED ZERO 


0031 00181 
0032 00180 
0033 00096 
0034 00113 

00004 


XCH R5 
XCH R4 
INC RO 
iSZ Ri ADLP 


/ACCUMULATOR TO R5, R5 TO ACCUMULATOR 
ICLEARS R4 IF AT END OF FIRST PASS 
/PREPARE FOR SELECTION OF NEXT ROM PORT 
/RETURN FOR SECOND PASS AFTER PASS 1 


/AFTER PASS 2, PROGRAM CONTINUES PAST THIS POINT. HIGH ORDER 
/BITS OF RESULT WILL BE IN R4, LOW ORDER BITS IN R5. 


A MULTIMILLION BYTE CORE REPLACEMENT 
UTILIZING MOS DYNAMIC MEMORY 


T.J. DeFranco, H.F. Bodio, W.F. Jordan : 
Intel Corp. Santa Clara, California 


Reprinted from the [EEE Intercon Technical Papers, 1973. 


SUMMARY 


MOS memory arrays are being employed in some of 
the largest commercially available computers. This 
paper describes the hardware and system design of a 
million byte mainframe memory module for use as an add- 
on to a 370-155 or 165. Design features which exploit 
the computers error correction capability are presented. 
Included also is a description of the off-line self 
tester. 


Each module is available as a self-contained en- 
closure closely matching IBM construction and appear- 
ance, aS shown in Fig. 1. 


Fig. 1 


In-70 Memory System 


The in-70 can replace all or part of the IBM mem- 
ory up to one megabyte per cabinet enclosure, with ex- 
pansion of Main Memory Capability to 4 million bytes on 
both systems 370/155 and 165. 


The effective density of the in-70 memory system 
is four times that of IBM's 3360 processor storage 
(Fig. 2) and dissipates 50% less heat. Due to this 
dramatic miniaturization in size and power, signifi- 
cant cost savings can be experienced by the system 370/ 
155 and 370/165 user. 


60” 
7. 
/ | 
/ | 
in a l 
~ | / IBM 62” 
Sl 
31” 
IBM 3360 in-70 
Processor Monolithic Main 
Storage Memory 
Capacity/Cabinet: 1/2 megabyte 1 megabyte 
Floor Space: 13.3 sq. ft. 6.6 sq. feet 


FIG. 2 Comparison of IBM 3300 and in-70 Memories 


The electronic technology used in the 'in-70 mem- 
ory is intrinsically more reliable than electromagnetic 
core memory. AS a result the user will experience 
greater availability and improved cost performance from 
his total system. Reliability is further enhanced by 
utilizing the Error Correction Code electronics in the 
CPU. Single bit errors, the major source of previous 
core memory failures, are detected and automatically 
corrected. 


ARCHITECTURE 


The in-70 memory is organized with a single bit per 
memory card. The 370-155/165 processors communicate si- 
multaneously with four storage modules of 36 data lines 
each, so that a minimum of 144 memory boards, with addi- 
tional control cards, are required for the implementa - 
tion of this architecture. Two 6.75" x 8" x 0.4" memory 
cards have been developed, with 1 bit x 16K words and 1 
bit x 32K words, corresponding to memory expansion mod- 
ules of 1/4 megabyte and 1/2 megabyte, respectively. 

The basic in-70 module block diagram is shown in Fig. 3. 
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MEMORY ARRAY 


144 x 16k WORD [—— > ouTPuT [7 > 


or 32k WORD BUFFER 


SHORTING 
PLUGS 


BUFFER 


CONTROL, 15 ADDRESS LINES, BSMO-BSM3 SELECT 


Fig. 3. in-70 Block Diagram 


- MOS Storage Element 


The semiconductor storage device used in the mem- 
ory cards is the Intel 1103, a 1024 bit low threshold 
P-channel, dynamic silicon gate MOS LSI RAM (Random 
Access Memory). The 1103 is organized as a 32 row x 32 
column array, with five address bits selecting the row, 
and five address bits selecting the column. The 1103 
is packaged in an 18 pin dual-in-line package with 
three pins required for power, three pins for control 
Signals PRECHARGE, CENABLE, and READ/WRITE, and two — 
pins for DATA IN and DATA OUT. Each memory element is 


_comprised of three P-channel enhancement mode field 
(Fig. 4). 


effect transistors (FET) 


Fig. 4. Dynamic MOS Memory Cell 
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In the dynamic cell of Fig. 4 data is stored as 
charge on the parasitic capacitance Cs of the gate Q2. 
Data on the WDATA line is written into the cell by en- 
abling Q] with WSEL. To read from the cell, the RDATA 
line (with capacitance Cp) is discharged to Vpp (ena- 
bled by signal @). When the signal RSEL enables Q3, 
the RDATA line will be charged to Vss through inverter 
Q2 if and only if the capacitor Cs contains a low and 
will remain low if and only if Cy contains a high. 
Therefore, the RDATA line then contains the logical com- 
plement of the cell data. Although the read out opera- 
tion from the cell is non-destructive, the leakage asso- 
ciated with the junction of Q] eventually may result in 
the loss of the charge stored in Cg. To maintain the 
data stored in the cell, the data must be periodically 
regenerated. This regeneration is accomplished by read- 
ing the contents of the cell out onto the read data 
line, inverting and amplifying the signal and applying 
it to the WDATA line, and rewriting back into the cell 
by activating the WSEL line. A circuit which performs 
the inversion and amplification function is called a 
refresh amplifier. In the 1103, the dynamic cells are 
laid out in a two dimensional array. One entire row of 
cells is refreshed (or accessed) at one time, one re- 
fresh amplifier being provided for each column of cells 
in the array. To refresh the entire memory, each row 
of cells must be individually refreshed. A block dia- 
gram of the 1103 is shown in Fig. 5. 


AoO 

Pa MEMORY MATRIX 

fi 32 ROWS 
COLUMNS 

As SELECTOR AMPLIFIER i 1024 BITS) 


REFRESH AMPLIFIER 


Vss O-——_—> READ/WRITE COLUMN 
GATING 
Vec OO 
PRECHARGE O-——————> 
CENABLE O———_>- 


1 of 32 
COLUMN 
SELECT 


READWRITE Q———> 
O O 
Ag Ag 


Fig. 5 Block Diagram of 1103 


Five address lines, Ao through Aq are decoded to 
select one row of cells. When accessed, the contents 
of this row are transferred to a row of 32 refresh am- 
plifiers. In the course of a memory cycle, whether 
read or write, the data is regenerated and written back 
into the selected row of cells. Address bits A5 through 
Ag are decoded to select one refresh amplifier for com- 
munication with the data input and output terminals. 
Data output is sensed as a current. Activation of the 
"write" clock effectively disconnects the refresh am- 
plifiersoutputs from the write data lines and permits 
the signal on the data input line to override the signal 
at the output of the selected refresh amplifier. Figure 
6 shows the basic timing of the 1103 memory cycle. The 
cycle timing is established by the three clock signals: 
precharge, cenable, and write. Initially (prior to ex- 
ecution of a memory cycle) all clocks are at their high 


state, at a voltage approximately equal to Vss. To be- 
gin a cycle, precharge is first brought low, to approx- 
imately Vpp potential. Referring to Figures 5 and 6, 
this operation activates the row and column decoders, 


and also charges all read and write data lines negative- 


ly. In the discussion which follows, clocks, etc., are 
considered "on" at Vpp level, and "off" at Vss level. 


sone 
PRECHARGE 
CENABLE ) 
ee p 
READ MRITE = WRITE 
a a CoD 
ae 
DATA OUT / \ INVALID 
Fig. 6. 1103 Timing 


After precharge and addresses have been active 
long enough for the data lines to discharge and the de- 
coders to stabilize, the cenable clock is turned on, 
the decoded read-select line is turned on, the read- 
data line precharge circuits are turned off, and the 
data lines are charged to the complements of the data 
Stored in the selected row of cells. As the read-data 
lines are selectively charged, the precharge signal is 
turned off, removing the precharge signal from the 
write-data lines and closing the path which enables 
WSEL, restoring the contents of the memory cells. 

When the write line is activated, the read data 
lines are discharged, disconnecting the refresh ampli- 
fiers from the write data lines, and enabling a path 
from the data input line to the selected cell. The 
signal on the data input will then overwrite the con- 
tents of the cell. 

The non-destructive read-out capability of the 
1103 is utilized in the "Refresh Abort" mode of opera- 
tion of the in-70 system. If precharge and cenable 
are turned off simultaneously during a memory cycle 
interrupt, the Intel 1103 memory contents are not al- 
tered, no matter where in the memory cycle the inter- 
rupt occurs. In the 370-155/165 application it is nec- 
essary to stop (abort) any refresh cycle which may be 
in process when the computer demands a memory access. 
The subsequent period normally allocated for a magnetic 
memory restore cycle is used to service the 1103 re- 
fresh cycle at the previously interrupted row address. 


SYSTEM IMPLEMENTATION 


The processor interface timing is shown in Fig. 9. 
A memory cycle is initiated when a "BSM Select" line 
goes TRUE at time TO, which enables a 36 bit slice of 
the 144 bit data Bus. The address lines must be stable 
within 100 nanoseconds of TO, and stay stable until 
TO+1300 nanoseconds. Read Data is stable well within 
the specified access time of 800 nanosec, and write 
data is strobed into the memory at approximately TO + 
1200 nanoseconds, if write control is true (Fig. 7). 


ear aaea a eee 
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BSM SELECT [ ] Lf | | 


(| DATA STABLE | __) 


WRITE CONTROL J | | | 


(_|__ADDREsS sTaBLE |] 


WRITE DATA 


ADDRESS 


READ DATA 


800ns 
Fig. 7 Processor Interface 
Timing 


Each 1/2 megabyte memory module contains refresh 
logic which refreshes 1/128th of the memory at approxi- 
mately 10 microsecond intervals. | 

The refresh logic consists of a 7 bit counter, to 
sequentially address the 128 segments of memory, a tim- 
ing circuit, and a multiplexer to steer the row ad- 
dresses from the 7-bit counter to the memory chips. 
Standard READ cycle timing circuits are used to. execute 
a refresh cycle, which requires approximately 500 nano- 
seconds to complete. 

If a memory cycle is in process when the refresh 
initiate logic times out, the refresh logic will wait 
until the memory cycle is complete before the refresh 
cycle is initiated. If a memory access occurs during a 
refresh cycle, the refresh cycle is aborted, the stand- 
ard memory cycle is completed, and the refresh cycle is 
then restarted. Consequently, the through-put of the” 
processor is not degraded because of the refresh re- 
quirements of the dynamic semiconductor memory. Al] 
refresh operations occur when the processor is not ac- 
cessing memory, and the memory is never "BUSY" while a 
refresh cycle is in process. 


BASIC EXPANSION MODULE 


The basic expansion module of the in-70 is 1/2 
megabyte, consisting of 144 ea. 32K bit capacity memory 
cards. The 144 cards correspond one-to-one with the 
IBM 144 data and byte-parity lines in a standard IBM 
storage of four 36 bits/word parallel access modules. 
If expansion in 1/4 Megabyte (256K byte) increments is 
required, 1/2 populated boards (16K words x 1 bit) are 
provided. The bit-per card organization coupled with 
single-bit failure error correction capability allows a 
faulty memory card to be replaced on-line, without in- 
terrupting the processor. The land patterns on the 
edge power connections are recessed relative to signal 
connections in order to insure proper logical biasing 
when plugging cards in and out. An expansion module, 
ae power supplies, occupies 1/2 of a 31"x31"x60" cab- 

The basic 32K x word 1 bit Mu-37 memory card is 
shown in Fig. 8. The memory card is TTL compatible, 
and contains address decoders, clock driver, sense am- 


Plifiers, and control to read and writ 
3 e data at an 
memory word location. ? 
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Fig. 8 MU-37 Memory Card 


The CU-37 control cards generate the timing sig- 
nals required to read, write and refresh data in the 
MU-37 memory cards. Two CU-37 control cards are re- 

' quired for each 1/2 megabyte module. 


Buffer Cards 


Three types of buffer cards are used in the in-70; 
the BU-37 address and control buffer, the DI-37 write 
data buffer, and the D0-37 read data buffer. 

One BU-37 card is required for each 18 memory 
cards, so a total of 8 each are required for a basic 
expansion module. Four DI-37 and four D0-37 cards are 
required for each basic expansion module. 


Self-Test 


An off-line self-test module is available for the 
in-70 memory, which can rapidly locate faulty or margi- 
nal devices. The self-test module simpley attaches to 
the door of the in-70 cabinet, as shown in Fig. 9. 


Fig. 9 Self-Tester Packaging 
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The tester is electrically connected to the memory 
by removing the shorting plugs and inserting "paddle- 
card" connectors. A number of modes of operation are 
possible. 

The operation and versatility of the tester is 
best understood by describing the controls and indica- 
tors as shown in Fig. 10. 
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Fig. 10 Self-Tester Panel Layout 


1. Data Select - The two rotary 16-position "DATA- 
BITS" switches are hexadecimal encoded, and used to load 
each data byte in each selected module with the two 4- 
bit hexadecimal characters selected on the switches and 
parity. All four modules, BSMO through BSM3 and the 
"BUMP" memory are loaded if the individual BSM "ERROR 
CHECK/BYPASS" toggle switches are in the CHECK position, 
and not loaded if in the BYPASS position. 

2. Single Address Test - To test a single memory 
location, the desired address is set on the "ADDRESS 
FORCING" toggle switches, and the "MANUAL LOAD" toggle 
switch is set first to the "W/DATA" position, and then 
to the "R/DATA" position. The "DATA IN DISPLAY" and 
the "DATA OUT DISPLAY" will indicate the encoded data 
words selected on the DATA BITS switches. 

3. Dynamic Memory Test - To test all memory loca- 
tions, one or more of the three "MODE CONTROL" toggle 
switches (R/W, R/W, W/R) must be in the "AUTO" position. 
When the "CLOCK CONTROL" switch is set to the "RUN" po- 
sition, the memory will sequentially step through all 
modes except those with the "BYPASS" position set on the - 
"MODE CONTROL" switch. To single step through the mem- 
ory, the "CLOCK CONTROL" switch is momentarily set to 
the "STEP" position for each address increment. The 
dynamic memory test patterns and test sequences are se- 
lected to insure that no address or data lines are held 
low or high, and that no. address or data lines are open 
or shorted together without being detected. 

4. Fault Isolation - One of the major fault iso- 
lation aids is the ability to either "CHECK" or "BYPASS" 
any data bit or BSM, with the "ERROR DETECTION" and 
"ERROR" switches respectively. Any address bit may be 
set to "1" or "0" (ADDRESS FORCING) to address any de- 
sired segment of memory repetively. Two BNC outputs 
are available for sync inputs of oscilloscope-aided 
tests; i.e., "SCOPE SYNC" and "ERROR OUT". "SCOPE SYNC" 
is programmed by setting "ADDRESS BIT SYNC" and/or "MODE 
SYNC" switches to a "IT" or "0", which enables an output 
"AND" gate whenever programmed bits coincide with the 
switch settings. The "ERROR OUT" sync pulse occurs 
whenever an unmasked error is detected. 

5. Unique Features - The self-test unit has sev- 
eral features which are helpful in locating random in- 
termittant errors caused by marginal devices. The clock 
may be varied from the normal 2.0 to 2.1 usec by setting 
the "CLOCK RATE" switch to the "FAST" or "SLOW" position. 
Typical variations are 5% to 10%, although the tester 
may be adjusted for wider or narrower perturbations as 
desired. The power supplies may also be varied by set- 
ting the "VOLTAGE MARGIN" switches to the "+" or "-" 


position, which will vary the selected supply or sup- 
plies by + 5%. Perhaps the most valuable trouble- 
shooting aid, in combinations with the marginal clock 
and supply switches, are the "BURST" and "DISTURB" func- 
tions. When the "BURST" switch is "ON", the memory 
cycles are interrupted for an interval of time from 4 
to 50 milliseconds determined by the "BURST FREQ" po- 
tentiometer. During the "BURST" interval, if the "DIS- 
TURB" switch is "ON", although no BSM is selected, the 
address lines are sequenced through addresses unrelated 
to the memory address at the time of the "BURST" inter- 
rupt. 


RELIABILITY AND MAINTAINABILITY 


In addition to the economics of lower inital cost 
and reduced size, power, and weight, the. in-70 has the 
additional advantages of increased reliability and main- 
tainability. 

The improvement in reliability may be attributed 
to several factors: 

1. Reduced number of interconnections: The major 
savings in interconnections is in the 1103 memory chip, 
with 1024 bits stored in a single 18 pin package, and 
addressed by a common 10 address lines. Partially de- 
coding 5 higher order address lines to select one of 32 
devices per memory card in an 8 x 4 array further re- 
duces interconnections. 

2. Improved Noise Environment: All MOS memory 
devices are buffered with TTL compatible interface de- 
vices such that there is no mixing of low level sense 
lines, TTL lines, and high-level MOS clock drivers on 
the memory back panel. On the memory card itself, the 
high level (16V) MOS clocks are on the opposite side of 
the board and at right angles to the low level (400mv) 
sense line outputs to minimize coupling. Only four 
1103 device outputs are common to a single sense line 
prior to buffering and translating the output to the 
TTL backpanel level. However, these four sense lines 
can access 4096 bits. 

~3. Silicon Gate Technology: The silicon gate 
technology eliminates the majority of MOS surface- 
related failure effects, since the gate is not only 
buried in protective dielectric but is also protected 
by an overall oxide barrier, no contamination from the 
external environment can introduce surface changes. In 
other MOS technologies contamination has caused in- 
creases ,in threshold voltages and leakage currents, or 
field inversion effects, depending on the location and 
nature of the contamination. 

, 4. System Burn=In: Because of the large number 
of semiconductor devices in the memory, the small per- 
centage of infant mortality failures inherent in any 
semiconductor device can cause unsatisfactory perform- 
ance if allowed to occur in the field. Consequently, 
each system is operated at elevated temperatures (typi- 
cally 50°C to 70°C depending on application) under dy- 
_namic conditions for a minimum of twelve hours. Failed 
devices are replaced with devices which have been burned 
in to the same or more stringent criteria than the mem- 
ory system. 

5. System Qualification: Final acceptance test- 
ing of the burned-in memory system is at elevated tem- 
perature and with # 5% variations in the Vo. and Ve¢ 
supplies. These are the conditions most Tikely to re- 
‘sult in "dropping a bit." If an error is detected, the 
marginal device is replaced, and the testing repeated 
until no errors occur. 

6. Power Reduction: Since the life of semicon- 
ductor devices correlates inversely with respect to 
power dissipation and heat, a significantly longer life 
‘May be expected relative to an otherwise comparable mag- 
netic or bipolar memory. 


“MAINTAINABILITY FACTORS 


On-Line Repair | 

The single error correction capability of the IBM 
370-155/165 processor is fully exploited by the in-70 
architecture. When a hard error occurs, the card is 
readily identified by the bit number of the failure and 
BSM address, and may be replaced on-line, with the miss- 
ing bit supplied in the interim by the processor Error 
Correction Code (ECC) and ECC logic. 


Rapid Off-Line Configuration and Off-Line Test 


All input and output lines between the memory and 
the processor are wired through shorting plugs.» Re- 
moving eight of these plugs interrupts all signals to 
a 1/2 megabyte memory module. The memory self-test 
unit may then be plugged in for rapid fault isolation 
in an off-line mode, as described previously. 


Memory Architecture and Internal Buffering 


Referring again to Fig. 3, the memory module is 
fully buffered at the processor interface, and again at 
each memory card. Consequently, the memory system back- 
panel provides an excellent intermediate starting point 
for troubleshooting and fault isolation. A line held © 
low or high can be quickly found by removing the memory 
or control cards on the bus. Since the memory sub- 
assemblies are very orderly in their implementation, 
the modular structure of the memory permits the tech- 
nique of transposition of memory cards for rapid fault 
location. Back panel wiring of address coded card po- 
sitions allows double bit errors (detected by CPU diag- 
nostics) to be translated into independent single bit | 
errors. 


CONCLUSION 


The semiconductor memory system described in this 
paper is not only more reliable than core memories of 
comparable size, but much easier to troubleshoot and 
repair when failures or data degradation does occur. 
Single bit memory errors can be repaired on-line, and 
fault-isdlation of marginal conditions can be accom- 
plished off-line during scheduled main-frame down time. 

Refresh operations are accomplished during normal 
machine cycles without interfering with memory access. 
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PRODUCT RELIABILITY: 1601 


INTRODUCTION 


The 1601 is a fully decoded 2048-bit electrically 
programmable MOS read-only memory whose basic 
nonvolatile memory element is a novel MOS charge- 
storage device!. In this bulletin, the results of relia- 
bility studies on this element and the 1601 itself are 
given. It is shown in the course of high temperature 
and high voltage lifetesting there have been 


~2,250,000 unit-hours 


O failures in ~ 100,000,000 bit-hours 


FAILURE MODES 


When a 1601 memory bit is in the ‘‘O’’ state, 


charge is stored in the MOS charge-storage device; . 


in the ‘‘1’’ state, no charge is stored. The device is 
fabricated using Intel’s standard p-channel silicon- 
gate technology? of proven reliability. Thus, the only 
reliability questions posed by use of this new MOS 
~ charge-storage device are 

How stable is the ‘0’ memory state? 

How stable is the ‘‘1’’ memory state? 


LIFETEST RESULTS 


To investigate the stability of the ‘‘O’’ state, 
discrete devices, 16-bit memory elements, and 
1601’s have been subjected to high temperature 


No. of 
Unit Temp. Units 


125°C 
125°C 
125°C 


Discrete device 
Discrete device 
16-bit Memory 


1601 200°C 
1601 125°C 
1601 125°C 


FEBRUARY 1971 


storage which is the worst case condition for this 
state!. The results obtained to date are given in 
Table 1. 


O FAILURES IN MORE THAN 1,500,000 UNIT-HOURS 


Time-to-failure versus temperature is shown in Fig. 1, 
where failure is taken as a 30% change in the logic 
level. It is evident from this figure that for the maxi- 
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Figure 1: Time-to-Failure Versus Temperature 


No. of No. of 
Hours Unit-Hours 


5215 777,035 
3835 383,500 
3835 383,500 
500 5,000 
2415 12,075 
1440 7,200 
1,568,310 


No. of 
Failures 


TABLE |: Summary of 125°C and 200°C Storage Lifetest results. 


mum operating temperature the time- to- failure ex- 
trapolates to a value well in excess of 500 years. 


The worst case condition for the “1 state is when 
voltage is applied to the charge-storage device. Table 
Il summarizes the results obtained to date. 


0 FAILURES IN ~650,000 UNIT-HOURS 


Time-to-failure versus applied voltage is shown in 
Fig. 2, based upon experiments performed at voltages 
well in excess of the maximum allowed voltage. At 
the maximum allowed voltage a time-to-failure well in 
excess of 100 years is indicated. 


FAILURE 
(HOURS) 


104 MAXIMUM 
1601 OPERATING 
OLTAGE 


(YEARS) 


5 10 15 20 25 30 
APPLIED VOLTAGE (VOLTS) 


Figure 2: Time-to-Failure Versus Applied Voltage 


No. of 
Unit ; Units 


Discrete device 10 
12-bit Memory 102 


1601 10 
1601 
1601 


_ CONCLUSION 


The 1601 does not involve any changes to Intel’s, 
standard p-channel silicon-gate technology. However, 
it does involve a novel charge-storage device. The, 
two possible failure modes associated with this tran-. 
sistor are 


loss of ‘‘0’’ level 
loss of ‘‘1”’ level 


In this bulletin, results of lifetests designed for the. 
worst case conditions for the above failure modes . . “ 
high temperature for loss of ‘‘O”’ level, and high vott-: 
age for loss of ‘‘1”’ level . . . are given. In all there. 
have been 0 failures in almost 100 million bit-hours. 
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No. of 
Hours 


3,835 
6,000 


No. of 
Unit-Hours 


38,350 
612,000 
2,000 20,000 
1,000 4,000 

500 3,000 
677,350 


No. of 
Failures 


TABLE Il: Summary of applied bias Lifetest results. 
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RELIABILITY REPORT- 
1103 SILICON GATE MOS LSI RAM 


1. INTRODUCTION 


The Intel 1103, 1024 bit dynamic, P channel 
silicon gate MOS random access memory has 
become one of the most important semiconductor 
components to be introduced in many years. 
Offering significant advantages over cores in cost, 
system flexibility and performance, this product 
is fast becoming the industry standard. In addition 
to all of the design features, the 1103 is also an 
extremely reliable device. This report was prepared 
to present the data to substantiate the reliability of 
the 1103. 


To date, based on the equivalent of 
8,663,000 unit hours of system lifetests, 
a best estimate failure rate of 0.008% per 
1000 hours at 55°C has been calculated 
for the 1103. 


ll, ELECTRICAL TESTING 


A. Lifetesting 

In order to assure the electrical reliability of 
the 1103 die itself, the following three categor- 
ies of lifetests were carried out.') 


1. System Lifetests 
a. Continuous System Lifetests 


Four operating 1103 system lifetests, each 
consisting of a 4096 word x 18 bit memory 
board are presently on lifetest. Each system con- 
sists of a memory board, with 72 plastic 1103's, 
assorted drivers, refresh control and clock gen- 
erator, and an exerciser board which generates 
the addressing, data generation, and error de- 
tection. The exerciser board generates an essen- 
tially random data pattern of 13 bits plus 5 
parity bits. The read or write mode is also con- 
trolled randomly. The output from the memory 


HOURS AT 


UNITS Nov. 22, 1971 


EQUIVALENT 


FEBRUARY 1973 


is read through the parity tree and errors are 
decoded to identify a failed unit. 


Table | summarizes the system lifetest results 
to date. 


All systems are operating at Vg, = +16 volts, 
Ven - Vsg = +3.5 volts, with a 650 nsec cycle 
time. Periodically the system clocking is slowed 
to verify the 2 msec refresh time. Figure 1 


shows the voltage margin plot on System #2 
initially and at present. 


b. Rotating System Lifetest 


As a means of assuring the continuing 
reliability of the 1103, a fifth system is in 


6 

® 
@ 
® 
@ 


@® AREAOF DATA 
SHEET GUARANTEED 
OPERATION 


Vss — Voo 


@ INITIAL READING 
@ READING AFTER LIFETEST 
ON SYSTEM 2 (SEE TABLE 2) 


Ves - Vss 


Figure 1. 1103 System Voltage Margin Plot @ Ta = 25°C 


FAILURES (2) 
(DC, AC, FUNCTIONALITY) 
CATASTROPHIC DEGREDATION 


UNIT HRS. 
@ 55°C (1eV) 


SYSTEM #1 72 8000 
SYSTEM #2 72 1000 
72 1110 
SYSTEM #3 72 1250 
72 1100 
SYSTEM #4 72 500 


5,750,000 0 
72,000 
799,000 
90,000 
792,000 
360,000 


7,763,000 
Table 1. 


(1) All data was taken with plastic packages with the exception of several groups of devices on HTRB which were ceramic. 


(2)in system operation functionality (continuous monitoring), refresh and system voltage margin are routinely checked. 


EQUIVALENT FAILURES'2) - 
UNIT HRS. (DC, AC, FUNCTIONALITY) 
UNITS HOURS @55°C(1.0eV) CATASTROPHIC DEGREDATION 


Oo 


120,000 
120,000 
120,000 
120,000 
120,000 
120,000 
120,000 
120,000 
120,000 
120,000 


oo o0coocmUcrmWCUCcamWdlUcUODlU OUD 
o.oo 08000 + © 


Table 2. 


Operation on a rotating system basis. The 
system is also a 4096 word by 18 bit operating 
as above. Twelve units are added weekly and 
allowed to remain for 6 weeks or 1000 hours. 
The initial results as of Work Week #47 are 
reported in Table 2. The one degredation failure 
was an increase in I 5, current. 


c. Summary of System Lifetests 


The data in Tables 1 and 2 represents 
the equivalent of 8,663,000 unit hours 
of system operation giving a best estim- 
ate failure rate of 0.008% per 1000 hours 
at 55°C. 


2. HTRB (High Temperature Reverse Bias) 


A second category of lifetesting, High Tem- 
perature Reverse Bias, consists of DC biasing Figure 2, Bias Configuration for Intel 1103 HTRB 
the chip for extended periods of time at . 
elevated temperature. It is the most common the past 6 months of production material. The 
form of lifetesting and is used to detect the one degredation failure was an increase in input 
presence of thermally activated failure modes. leakage current. 
Figure 2 shows the bias configuration used 
for the 1103 HTRB lifetests. 3. Storage 


In Table 3 we present HTRB data on the A third common lifetest is extended high 
1103. The units reported cover samples from temperature storage. This lifetest is generally 


EQUIVALENT FAILURES '2) 
UNIT HRS. (DC, AC, FUNCTIONALITY) 
a. UNITS HOURS @55°C (1eV) CATASTROPHIC DEGREDATION 
HTRB 70°C 28 7350 2,060,000 0 ) 
HTRB 125°C 20 1000 2,000,000 a) ) 
HTRB') = 125°C 27 6300 17,000,000 fy) ) 
HTRB'') = 125°C 105 1000 10,500,000 0 1 


Table 3. 


(1) Ceramic package. 
(2)in system operation functionality (continuous monitoring), refresh and system voltage margin are routinely checked. 
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EQUIVALENT FAILURES 


UNITS HOURS 


Storage 
Storage 


Storage '1) 


(1)Ceramic package. 


aimed at mechanical reliability and process 
stability. 


Table 4 summarizes storage data on the 
1103. 


B. End-of-Life Data 

Virtually all active semiconductors tend to 
undergo some change in their basic parameters 
during life; however, the basic reliability of the 
silicon gate process has been well established 
through numerous lifetests (a). For example, 
ionic contamination can result in degradation 
of p-n junction characteristics which will be 
evidenced by increased input leakage or major 
changes in refresh times. Likewise, surface 
charging can lead to leakage increases and loss 


1011 


1010 


TIME (HOURS) 


109 


10-5 
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300 250 160 125 70 35 25 


JUNCTION TEMPERATURE (°C) 


Figure 3. Plot of Time vs Temperature for Threshold Drift 


UNIT HRS. (DC, AC, FUNCTIONALITY) 

@ 55°C (1eV) CATASTROPHIC DEGREDATION 
1,050,000 

17,700,000 

34,000,000 

Table 4, 


of isolation between nodes. The effects of 
surface charging or ionic contamination are 
generally catastrophic to the operation of a 
dynamic array and are thus readily detectable. 
The primary effect on end-of-life data comes 
from the positive charge accumulation at-the 
oxide-silicon interface. This inherent phenome- 
non results in slight increases in device threshold 
voltages. Its effect on MOS devices and arrays 
has been well documented. (a, b, c, d, e) 


These charge migration failure modes are 
thermally activated; that is, one can trade time 
for temperature. Figure 3 shows a plot of time 
vs temperature for threshold drift (1eV) and 
ion migration (1.4 eV) activation energies. Note 
that 1000 hours at 125°C is equivalent to 
100,000 or 1,000,000 hours at 55°C, depending 
on the failure mechanism. 


A group of 20 devices was subjected to a 
1000 hour HTRB lifetest. The units were on the 
static bias configuration illustrated in Figure 2. 
They were connected in parallel and stressed 
at 125°C. Table 5 presents the initial and final 
average leakage current readings by pin as well 
as deviations and worst case values. 


Table 6 shows the initial and final power 
supply current values as well as standard devi- 
ation and worst case values. 


The results show that both power drain and 
leakage currents are stable with only one pin 
showing an increase to 130 nA (spec. limit is 
1 pA) after 1000 hours. 


The output current or Io), results for the life- 
test are included in Table 6 and prove to be 
stable. 


All readings were taken with a Teradyne J259 
system. 


C. Refresh 


The basic operation of the 1103 depends on 
the retention of charge on a storage node for 
a certain length of time. Figure 4 shows the 
cell structure indicating the location of the 
storage node. 

The primary charge loss mechanism from the 
storage node is the leakage current of the p-n 
junction associated with that node. The length 
of time the charge can be retained at a useful 


INITIAL (in nanoamps) 


STD DEV 


INITIAL (in milliamps) 


AVG STD DEV MAX 
36.6 
40.7 
6.6 
2.15 


1.09 (Min.) 


max 


STDDEV  — MAX __|(in nanoamps) 


FINAL (in milliamps) . 


MAX A 
STD DEV MAX (in milliamps) 


AVG 


36.2 
40.2 
6.5 
2.16 


1.08 (Min.) 


Table 6. Power Supply Current and lon Readings Before and After 1000 Hour HTRB 


level is termed the refresh time. Figures 5 and 
6 show the refresh times for the first fail and 
second fail bits before and after a 50 hour 
160°C (equivalent to 1000 hours at 125°C) 
system burn-in. 


D. Voltage Margin 

In the operation of the 1103, two supply 
voltages are involved: V,,, the voltage to 
Operate the device and V,,, the substrate 
potential. Functionality is guaranteed for Von 
+5% and Vep = 3 to 4 volts. 


The range of supply voltages for operation 
is defined as the voltage margins of the device. 
Rather than attempting to present individual 
devices, the voltage margin plot of a 4096 word 
by 8 bit system was presented in Figure 1. The 
final reading is after 1000 hours of operation at 
55°C plus 1000 hours at 70°C. The system plot 


Figure 4. Basic RAM Dynamic Cell 
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+5% MEASUREMENT ACCURACY 


Trer FINAL (MSEC) 


Ta = 25°C 
60 SEC WARMUP 


0 50 100 150 200 250 


Treg INITIAL (MSEC ) 


Figure 5. First Fail 


defines the worst case device at each measure- 
ment point. Other system tests and discrete de- 
vices also show the same stability. 


il, PLASTIC MECHANICAL 
A. Moisture Failure Modes 

One of the primary functions of an integrated 
circuit package is to prevent moisture penetration 
to the die itself. Such penetration leads to eventual 
‘corrosion of the aluminum metallization of the 
array. In the case of packages with internal cavities, 
‘screens such as Fine and Gross Leak tests have 
been developed to demonstrate a hermetic seal 
which guarantees against moisture penetration. 
Unfortunately, non-destructive screens as simple 
as these have not been developed for plastic 
packages. The test that is most popular is the 
so-called ‘‘pressure cooker’’test, i.e., exposure to 
30psia steam. For example, one large computer 
manufacturer who relies very heavily on plastic 
encapsulated devices, specifies that the devices 
must be capable of surviving 6 hours of exposure 
to 30psia steam. Intel’s standard silicone lot 
qualification also includes 3Opsia steam. testing. 
Lots failing the initial qualification or subsequent 
monitors are rejected and new qualified material 
is put into production. 

1. Autoradiographic Analysis: 0 Fail/10 Devices 


A procedure has recently become available 
which detects minute amounts of moisture 
penetration - below the levels necessary to 
cause corrosion - as well as the entry path of 
moisture. The technique involves the use of 
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Figure 6. Second Fail 


radioactive Zn as a tracer in an aqueous 
solution. After exposure to this environment at 
30psia, the units are sanded to remove excess 
plastic and placed on X-ray film. Moisture paths 
into the chip will then show on the film. Figure 
7 shows two such film plates from a group of 
ten 1103’s which were subjected to 24 hours 
exposure at 30psia to the aqueous solution. 
The autoradiogram clearly shows no evidence 
of moisture penetration. This was true for all 
10 devices. 


Figure 7. Results of Autoradiographic Analysis on 2 1103's 


2. Mil Std 883 Method 1004: 0 Fail/50 Devices 


A commonly utilized test, taking somewhat 
longer than the pressure cooker, is the Method 
1004 of Mil Std 883. This moisture test involves 
10 cycles of 1 day each at various temperature/ 
humidity conditions with bias. Figure 8 outlines 
one such cycle. 
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Figure 8. Graphical Representation of Moisture- Resistance Test-MIL STD-883, Method 1004-2 eta 


A group of 50 1103's representing six fab- 
rication lots was subjected to the Method 1004 
test and resulted in no functional, AC, DC, or 
mechanical failures. 


B. Bond-Related Failure Modes 
Bond-failures - opens and thermal inter- 
mittence -- can result because of the differences 


in thermal expansion coefficients between the - 


plastic, the leads, and lead/frame/die system. 
These differences set up stresses as shown 
schematically in Figure 9. The magnitude of 
these stresses depends on the design of the lead 
frame, the type of lead bonding employed, and 
the type of plastic and the conditions under 
which it was molded and cured. 
1. Temp Cycling (—55°C to 150°C): 1 Fail/ 

26000 Unit Cycles 

Intel uses temperature cycling as a con- 
tinuing process monitor on all products encap- 
sulated in plastic. Units are selected from 
current production lots and are subjected to 
temperature extremes of —55°C to 150°C. 
For example, during the past 3 months, 260 
P1103’s from 26 production lots were each 


Figure 9. Schematic Representation of Stresses Set Up in a Piste 
Package 


subjected to 100 temperature cycles with only 
one bond failure. 


2. Thermal Shock (—65°C to +125°C): 0 Fail/ 
6250 Unit Cycles 
Another commonly utilized test of band | 
integrity is thermal shock. The units are placed 
in a 125°C bath of FC40, soaked 5 minutes, 
transferred (less than 10 seconds) to an alcohol- 


Group A 25°C - 110°C 
Group B 25°C - 110°C 


FAILURES 
HOURS CYCLES (FUNCTIONAL, AC, DC, 
CONTINUITY OVER TEMP.) 
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Table 7. 


dry ice mixture, and soaked 5 minutes. A group 
of 25 P1103’s was subjected to 250 such 
cycles. The test resulted in no failures. 


3. Power Cycling (T, - 25°C to 110°C):0 Fail/ 
2,112,000 Unit Cycles 


A less severe but more realistic test of bond 
integrity is power cycling, where the unit is 
alternately powered and cooled. The bias vol- 
tages are adjusted to bring the chip to about 
110°C within 2.5 minutes, and then a fan is 
used to return the unit to room temperature in 
the next 2.5 minutes. Table 7 shows the results 
of extended power cycling for two groups of 
1103's. 


4. Storage (125°C -0 Fail/790,000 Unit Hours; 
160°C - 0 Fail/390,000 Unit Hours) 

Another test of bond integrity is extended 
high temperature storage. 243 non-functional 
units were stored at elevated temperature as 
summarized in Table 8. Note that only con- 
tinuity of bonds over the range O°C to 90°C 
was measured at each readout. 


FAILURES 
Ty UNITS HOURS (CONTINUITY) 


125°C 158 5000 
160°C 78 5000 


Table 8. 
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Selection Guide 


Random Access Memories 


Electrical Characteristics over Temperature 


Power 
Dissipation* 


SILICON 


Dynamic Fully Decoded 
Static Fully Decoded 


Hi-speed Fully Decoded 
(With 3-state Output) 


Hi-speed Fully Decoded 
(With Open Collector Output) 


Hi-speed Fully Decoded 
(With Open Collector Output) 


SCHOTTKY BIPOLAR 


‘Content Addressable Memory 


Power 
Type i Dissipation* 
‘ Max. 


2048 Electrically Programmable (Static} 
1702A 2048 Erasable Electrically Programmable 
(Static) 


oe | me Mask Programmable (Static) 


| 3301A [1024 | High Speed, Mask Programmable 


3304 4096 High Speed, High Density 
3601 1024 High Speed Electrically 
Programmable 
No. 
Type of Description Dissipation* 
Bits Max. 


SILICON GATE MOS 
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SCHOTTKY 


BIPOLAR 
PROM 


1024 Quad 256 Bit Dynamic 
1024 Dual 512 Bit Dynamic 

1404A 1024 1024 Bit Dynamic 
14054 512 Dynamic Recirculating 
Dual 100 Bit Dynamic 
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**The 1506 and 1507 are also available in military temperature range (~—55° to +125°). To order specify 1406 or 1407, respectively, 


SILICON GATE MOS 


Memory Peripherals 


Electrical Characteristics over Temp. ; 
Description Supplies [V] 


Quad Bipolar to MOS Level Shifter and Driver 
High Speed 6 Bit Latch 
3408A Hex Sense Amp and Latch for MOS Memories 


*Power Dissipation calculated with maximum power supply current and nominal power supply voltages. 


SCHOTTKY 
BIPOLAR 


WEST 


ARIZONA 
Hamilton/Avnet Electronics 
2615 South 21st Street 
602/275-7851 
Phoenix 85034 
Cramer/Arizona 
2816 N. 16th Street 
602/263-1112 
Phoenix 85006 


CALIFORNIA 
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340 E. Middlefield Road 
415/961-7000 
Mountain View 94041 
Cramer/San Francisco 
720 Palomar Ave. 
408/739-3011 
Sunnyvale 94086 
Hamilton Electro Sales 
10912 W. Washington Blvd. 
213/870-7171 
Culver City 90230 
Cramer/Los Angeles 
17201 Daimler Street 
714/979-3000 
Irvine 92705 
Hamilton/Avnet Etectronics 


U. S. DISTRIBUTORS 


MID-AMERICA 


ILLINOIS 


Cramer/Chicago 

1911 South Busse Road 
312/593-8230 

Mt. Prospect 60056 
Hamitton/Avnet Electronics 
3901 North 25th Avenue 
312/678-6310 

Schiller Park 60176 


KANSAS 
Hamilton/Avnet Electronics 
37 Lenexa Industrial Center 
913/888-8900 
Lenexa 66215 


MICHIGAN 
Sheridan Sates Co. 
33708 Grand River Avenue 
313/477-3800 
Farmington 48204 
Cramer/Detroit 
13193 Wayne Road 
313/425-7000 
Livonia 48150 
Hamilton/Avnet Electronics 
12870 Farmington Road 
313/522-4700 


OHIO 


Cramer/Tri-States, Inc. 
666 Redna Terrace 
513/771-6441 
Cincinnati 45215 
Sheridan Sales Co. 
10 Knotlcrest Drive 
§13/761-5432 
Cincinnati 45237 
Cramer/Cleveiand 
§835 Harper Road 
216/248-7740 
Cleveland 44139 
Sheridan Sales Co. 
7800 Wall Street 
216/524-8120 
Cleveland 44125 
Sheridan Sales Co. 
Shiloh Bidg., Suite 250 
5045 North Main St. 
513/277-8911 
Dayton 45405 


TEXAS 
Cramer Electronics 
2970 Blystone 
214/350-1355 
Dallas 75220 


8817 Complex Drive 


Livonia 48150 


Hamilton/Avnet Electronics 


NORTHEAST 


CONNECTICUT 


Hamilton/Avnet Electronics 
643 Danbury Road 
203/762-0361 

Georgetown 06829 
Cramer/Connecticut 

35 Dodge Avenue 
203/239-5641 

North Haven 06473 


MARYLAND 


Cramer/EW Baltimore 
922-24 Patapsco Avenue 
301/354-0100 
Baltimore 21230 
Cramer/EW Washington 
16021 (ndustrial Drive 
301/948-0110 
Gaithersburg 20760 
Hamilton/Avnet Electronics 
7255 Standard Drive 
301/796-5000 

Hanover 20176 


MASSACHUSETS 


Cramer Electronics, {nc. 
85 Wells Avenue 
617/969-7700 

Newton 02159 


Hamiiton/Avnet Electronics 


SOUTHEAST 


ALABAMA 


Cramer/EW Huntsville, tnrc. 
2310 Bob Wallace Avenue 
205/539-5722 

Huntsvilte 35805 


FLORIDA 


Cramer/EW Hollywood 
4035 North 29th Avenue 
305/923-8181 

Hollywood 33020 
Hamilton/Avnet Electronics 
4020 North 29th Avenue 
305/925-5401 

Hollywood 33021 
Cramer/EW Orlando 

345 North Graham Avenue 
305/894-1511 


Orlando 32814 
GEORGIA 


Cramer/EW Atlanta 
3923 Oakcliff Industrial Court 
404/448-9050 


Atlanta 30340 


Hamilton/Avnet Electronics 
6700 Interstate 85 Access Road 
404/448-0800 

Norcross 30071 


Distributors 


MINNESOTA NORTH CAROLINA 


714/279-2421 4445 Sigma Road 


San Diego 92123 
Cramer/San Diego 
8975 Complex Drive 
714/565-1881 

San Diego 92123 


COLORADO 
Cramer/Denver 
5465 E. Evans Place at Hudson 
303/758-2100 
Denver 80222 


Hamilton/Avnet Electronics 
5921 N, Broadway 
303/534-1212 

Denver 80216 


NEW MEXICO 
Cramer/New Mexico 
137 Vermont, N.E. 
505/265-5767 
Albuquerque 87108 


OREGON 


Almac/Stroum Electronics 
8888 S.W. Canyon Road 
503/292-3534 

Portland 97225 


UTAH 


Cramer/Utah 

391 W. 2500 South 
801/487-3681 

Salt Lake City 84115 
Hamilton/Avnet Electronics 
647 W. Biltinis Road 
801/262-8451 

Salt Lake City 84115 


WASHINGTON 
Hamilton/Avnet Electronics 
13407 Northrup Way 
206/746-8750 
Bellevue 98005 
Almac/Stroum Electronics 
§811 Sixth Avenue South 
206/763-2300 
Seattle 98108 
Cramer /Seattle 
5602 6th Avenue South 
206/762-5755 
Seattle 98108 


Cramer/Bonn 

7275 Bush Lake Road 
612/941-4860 

Edina 55435 
Hamilton/Avnet Electronics 
2850 Metro Drive 
612/854-4800 

Minneapolis 55420 
tndustrial Components, !nc. 
5280 West 74th Street 
612/831-2666 

Minneapolis 55435 


MISSOURI 
Sheridan Sales Co. 
110 South Highway 140, Suite 10 
314/837-5200 
Florissant 63033 
Hamiliton/Avnet Electronics 
392 Brookes Drive 
314/731-1144 
Hazelwood 63042 


214/661-8661 
Daltas 75240 


Hamitton/Avnet Electronics 


1216 West Clay 
713/526-4661 
Houston 77019 


WISCONSIN 
Cramer/Wisconsin 
5626 N. 91st Street 
414/462-8300 
Mitwaukee 53225 


185 Cambridge Street 
617/273-2120 
Burlington 01803 


NEW JERSEY 


Hamilton Electro Sales 
218 Little Falls Road 
201/239-0800 

Cedar Grove 07009 
Cramer/New Jersey 

No. 1 Barrett Avenue 
201/935-5600 
Moonachie 07074 
Hamilton/Avnet Electronics 
113 Gaither Drive 

East Gate Industrial Park 
609/234-2133 

Mt. Laurel 08057 
Cramer/Pennsylvania, Inc. 
7300 Route 130 North 
609/662-5061 
Pennsauken 08110 


NEW YORK 


Cramer/Binghamton 
3220 Watson Boulevard 
607/754-6661 

Endwell 13760 


Cramer/Rochester 

3000 Winton Road South 
716/275-0300 

Rochester 14623 
Cramer/Syracuse 

6716 Joy Road 
315/437-6671 

East Syracuse 13057 
Hamilton/Avnet Electronics 
6400 Joy Road 
315/437-2642 

Syracuse 13057 
Cramer/Long {sland 

29 Oser Avenue 
516/231-5600 

Hauppauge, L.I. 11787 
Hamilton/Avnet Electronics 
70 State Street 
516/333-5800 

Westbury, L.!. 11590 


PENNSYLVANIA 


Sheridan Sales Co. 
4268 North Pike 
North Pike Pavilion 
412/373-1070 
Monroeville 15146 


Cramer Electronics 
938 Burke Street 
919/725-8711 


Winston-Salem 27102 


CANADA 


BRITISH COLUMBIA 


L.A. VARAH Ltd. 
2077 Alberta Street 
604/873-3211 
Vancouver 10 


ONTARIO 


Cramer/Canada 

920 Ainess Avenue, Unit No. 9 
Downsview 

416/661-9222 

Toronto 392 
Hamilton/Avnet Electronics 
6291 Dormain Rd. No. 19 
416/677-7432 

Mississauga 

Hamiiton/Avnet Electronics 
880 Lady Ellen Place ; 
613/725-3071 

Ottawa 


QUEBEC 


Hamitton/Avnet Electronics 
935 Monte De Liesse 
514/735-6393 

St. Laurent, Montreal 377 
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Several significant new products were announced by Intel in the third quarter of 1973. The announced products 
included RAMs which ranged in size from 1024 to 4096 bits. The 1024 bit RAMs were the 1103A and 2105. 
The 1103A is electrically pin compatible to the 1103, however, it does not require a precharge clock. The 2105 
is a high speed N-channel dynamic RAM with an access time less than 100 nsec. A low cost, high bit density 
RAM (4096 bits), the 2107, was announced in July. Like the 2105, the 2107 is fabricated using N- channel 
silicon gate technology. 


The above devices, along with their peripheral circuits, will be some of the new devices discussed in the wpb 
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